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PREFACE 


The rapidly increasing number of students beginning graduate 
work are handicapped by the lack of books in English which 
provide readable introductions to important parts of mathematics. 
Nor is the difficulty met adequately by the slow method of lecture 
courses. 

IHirpose of This book is based on the author’s lectures of recent 
this book years. It presupposes calculus and elementary theory 
of algebraic equations. Its aim is to provide a simple introduction 
to the essentials of each of the branches of modern algebra, 
with the exception of the advanced part treated in the author’s 
Algebras and Their Arithmetics. The book develops the theories 
which center around matrices, invariants, and groups, which 
are among the most important concepts in mathematics. 

It is a text for The book provides adequate introductory courses 
several courses higher algebra, (ii) the Galois theory of 

algebraic equations, (iii) finite linear groups, including Klein’s 
“icosahedron” and theory of equations of the fifth degree, and 
(iv) algebraic invariants. 

Higher The subject known in America as higher algebra is 
algebra treated fully in Chapters III-VI; it includes matrices, 
linear transformations, elementary divisors and invariant factors, 
and quadratic, bilinear, and Hermitian forms, whether taken singly 
or in pairs. While the results are classic, the presentation is new 
and particularly elementary. Due attention is given to ques- 
tions of rationality, which are too often ignored. The imified 
treatment of Hermitian and quadratic forms requires but little 
more space than would be needed for quadratic forms alone. 
Elementary divisors and invariant factors are introduced in 
Chapter V in a simple, natural way in connection with the classic 
and a new rational canonical form of linear transformations; this 
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treatment is not only more elementary than the usual one, but 
develops these topics in close connection with their most frequent 
applications. It is then a simple matter to deduce in Chapter VI 
the theory of the equivalence of pairs of bilinear, quadratic, or 
Hermitian forms. We thereby avoid the extraneous topic of 
matrices whose elements are polynomials in a variable and the 
'^elementary transformations’^ of them. 

Algebraic Is every equation solvable by radicals? This question 
equations jg Qf absorbing interest in the history of mathe- 
matics. It was finally answered in the negative by means of 
groups of substitutions or permutations of letters. The usual 
presentation of group theory makes the subject quite abstruse. 
This impression is avoided in the exposition in Chapters VII-XI. 
Substitutions are introduced in a very deliberate and natural 
way in connection with the solution of cubic and quartic equa- 
tions. The reader will therefore appreciate from the start some 
of the reasons why substitutions and groups are employed. 
Fortunately we are able to alternate theory and application in the 
further exposition of groups. The theory gives very simple 
answers to the following questions: Can every angle be trisected 
with ruler and compasses? What regular polygons can be con- 
structed by elementary geometry? 

Icosahedron, Klein’s book on the icosahedron and equations of 
linear groups degree is a classic, but causes real diffi- 

culties to beginners on account of the inclusion of ideas from 
many branches of higher mathematics. Chapter XIII gives a 
simple exposition of the essentials of this interesting theory, 
which is a prerequisite to the subjects of elliptic modular func- 
tions and automorphic functions. The preliminary Chapter XII 
discusses the removal of several terms from any equation by 
means of a rational (Tschirnhaus) transformation, and the reduc- 
tion of the general equation of the fifth degree to Brioschi’s normal 
form, which is weU adapted to solution by elliptic functions. The 
final Chapter XIV is a sequel to Klein’s theory. It establishes 
remarkable results on the representation of a given group as a 



PREFACE 


V 


linear group, and gives an introduction to Frobenius^s theory of 
group characters. The latter is an effective tool for finite groups 
and has led to important new results. 

Algebraic Chapters I and II provide an easy introduction to the 
invariants important subject of invariants. Hessians and Jaco- 
bians are shown to be covariants and applied to the determina- 
tion of canonical forms of binary cubic and quartic forms, as 
well as to the solution of cubic and quartic equations. Every 
seminvariant is proved to be the leading coefficient of one and 
only one covariant. It is shown that all covariants of any 
system of binary forms are expressible in terms of a finite number 
of the covariants. Such a fxmdamental system of covariants is 
actually found for one form of each of the orders 1, 2, 3, 4. Valu- 
able supplementary work on invariants is provided by Chapter 
XIII, which presupposes the concept of groups of substitutions 
explained in the elementary Chapter VII. 

There are numerous sets of simple problems, and a few historical 
notes. On pages 38, 133, 176, 203, and 249 there are lists of topics 
for further reading, with references to writings in English. These 
topics are suitable for assignment to students for full reports at the 
end of the particular course. 

L. E. Dickson 

University of Chicago 
March 16, 1926 
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MODERN ALGEBRAIC THEORIES 


Chapter I 

INTRODUCTION TO ALGEBRAIC INVARIANTS 

Invariants and covariants play an important r&le in the various 
parts of modern algebra as well as in geometry. The elementary 
theory presented in this chapter will meet the ordinary needs in 
other parts of mathematics. It covers rather fully the subject of 
invariants and covariants of a homogeneous pol 3 niomial in x and 
y of degree < 5, with application to the solution of cubic and 
quartic equations. When the degree exceeds 4, most of the cova- 
riants are too long to be of real use, unless their symbolic repre- 
sentation is employed. 

1. Linear transformations. When two pairs of variables a;, y 
and 7j are connected by relations of the form 

X = + brjj y = + d% D = ^ ^ 0, 

these relations define a linear transformation T of determinant D. 
Consider another linear transformation U defined by 

^ = eX+fY, v = gX + hY, 

The equations obtained by eliminating ^ and rj are of the form 
X “ kX + lY, y = mX + tiF, 

and define a linear transformation P which is called the product 
of T and 17, taken in that order, and is denoted by T{7. The 
values of the coefficients are 
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k = ae + hg, I = af + bhj m = ce + dgj n = c/ + dh. 

The determinant of P is found to be equal to PA, and hence is 
not zero. If we solve the equations of T and in the result replace 
xhj X and y by F, we get 

? = P-i dX - P-i 6F, 77 = -- P-^ cX + P-i aF. 

These equations define a transformation called the inverse of T 
and denoted by Since the variables of are the same as 
those of U, the product TT~^ is found by eliminating f and rj 
and hence is x = X, y = Y, The latter is called the identity 
transformation. It is readily verified that also T is the identity 
transformation. 

We shall next prove the associative law 
TU-V = T • UV, 

which allows us to write TUV for either product. Let 
X pu + qvj Y = ru + sv 

be the equations of V. The product TU • V is found by elim- 
inating first Tj and then X, F between the equations of T, P, F, 
while the product T • UV is obtained by eliminating first X, F 
and then r} between those equations. In each case we must 
evidently obtain the same equations expressing x and y as linear 
functions of u and v. 

2, Fonns and their classification. A polynomial like 
ax® + bx^y + cxyz + dxz^^ 

every term of which is of the same total order (here 3 ) in the vari- 
ables X, y, z, is called homogeneous in x, y, z. A homogeneous 
polynomial is called deform. According as the number of variables 
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is 1, 2, 3, , or g, the form is called unary ^ binary, ternary, 

quaternary, . . . , or q~ary, respectively. According as the order of 
the form is 1, 2, 3, 4, , p, it is called linear, quadratic, cubic, 
quartic, . . . , p-ic, respectively. 

For example, the polynomial displayed above is a ternary cubic 
form, while ax^ + bxy + cy^ is a binary quadratic form. 


3. Hessians. The Hessian (named after Otto Hesse) of a func- 
tion /(a;, y) ot two variables is the determinant 



av 

dx^ 

dxdy 

d^f 

av 

dydx 

dy^ 


whose elements are second partial derivatives. For example, the 
Hessian of / = ax^ + 2bxy + cy*^ is 4:(ac — b^). 

The Hessian h of f{xi, ,,, ,Xq) is the determinant of order q in 
which the elements of the ith row are 

(2) ^ dV ^ d^f 

dX^ dXi dx^ dX2 dXt dXq 

Let / become F(yi, . . . , 2/3) under the hnear transformation 

(3) = Cti 2/1 + * • • + Ctg 2/« I 7 • • • > Q.)} 


whose determinant is 



Cql * * ' O33 


The product AA is a determinant of order q whose element in 
the ith row and jth column is the sum of the products of the 
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elements (2) of the ^’th row of h by the corresponding elements 
Cl 3, C2,, . . . , Cg; of the jth column of A and hence is equal to the 
partial derivative with respect to a; i of 


( 4 ) 


— Cl, + — C2,- + 
dxi 0X2 


+ ■^0 • 
^ dx/^’ 


dxi dyj dx, dyj dy,- 


Let A' denote the determinant obtained from A by interchang- 
ing its rows and columns, whence A' = A. In the product A' ■ h A, 
the element in the rth row and jth column is therefore 

d dF d dF d dF 

Clr ~ r !-•••+ Cjr r 7 = ~ ~ ’ 

dXi dyj dXg By, Byr Byj 


since is the partial derivative of a;,- in (3) with respect to yr. 
Hence 


An = 


Bn 


dyr Byj 


rj 


J— 1> 


• > Q 


Hessian of F. 


Theorem 1. If F is the function oUained from f by applying 
any linear transformation of determinant A, the Hessian of F is 
equal to the product of the Hessian of f by A^. 

4. Definition of invariants and covariants. Let a transforma- 
tion T of determinant A, expressing x and y linearly in terms of 
^ and ^7, replace f = ax^ + 2bxy •+* cy^ by + 2B^ri + Cif. We 
saw in §3 that the Hessian of / is the product of the discriminant 
ac — 6^ of / by 4, By Theorem 1, 

AC - = AHac - 62). 


We therefore call the discriminant ac — ¥ an invariant of index 2 
of /. We shall generalize this definition. 
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Consider the general binary form of order p, 

f = aoxP + ai 2/ + b 2/^. 

Let a transformation T replace / by 

F = Ao + Ai rj + ^ ‘ + Ap 7 ]P. 

A polynomial I (ao, . . . , a^) is called an invariant of index I of f 
if, for every transformation T of determinant A 0, 

I (Aq, . . . , Ap ) JT (<Xo, . . . , ctp ), 

identically in ao, . . . , after the A’s have been replaced by their 
values in terms of the a’s. 

Covariants K are defined similarly. If, for every transformation 
T of determinant A 0, a polynomial K in the coefficients and 
variables of / has the property that 

K(Ao, . . . , Aj,; 97) = A^Z(ao, . . . , ap]X,y), 

identically in Uo, . . . , a^, 77, after the A^s have been replaced 

by their values in terms of the a^s, and after x and y have been 
replaced by their values in terms of f and 77 from T, then K is 
called a covariant of index I of /. 

By Theorem 1, the Hessian of / is a co variant of index 2 of /. 
Note that / itself is a covariant of index zero of / since F = A°/- 
By a covariant of index Z of a system of forms /i, . . . is 
meant a function of their coefficients and variables a^i, . . . ,a;g 
whose product by AMs equal to the same function of the corre- 
sponding coefficients and variables ?/i, . . . ? 2/3 forms 

Fij...jFk derived from /i, by applying the general 
linear transformation (3) whose determinant A is not zero. An 
example is given by Theorem 2 below. 

A covariant which does not involve any of the variables is an 
invariant. 
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5. Jacobians, Th.^ functional determinant or Jacobian (named 
after C. G. J. Jacobi) of q functions f\{xu . . . , . . . jfqixi, 

, . , ,Xq) with respect to the variables Xi, . . . , is defined to be 
the determinant 



dfi 

dfi 

Bxx 

!? 

<15 

dXq 


dfa 

d/g 

dxi 

dX2 

dXq 


Theorem 2. The Jacobian o/ /i, . . . , fq is a covariant of index 
unity of the system of forms /i, . . . , fq. 

Under a transformation (3) of determinant A, let fi become 
Fi{yi^ * ^yq). Then (4) holds if we put the subscript i on each 
/ and jP. Hence the Jacobian oi Fi, , Fq with respect to the 


variables yu • • * 

, 2/2 is equal to 




— Cii + 

dxi 

. . . _ Cji - • ■ 

aXf 

4 - 

' + * • 
dXi 

dfi 


z' 4 - 
— On + 

dXi 

* * • + T~" ^ai ’ * 
BXq 

• Clg + • 

dxi 

^ dx, 


This determinant is the product of the determinant (5) by the 
determinant A = | ] . 


Exercises 

1. If /i, . . . ,/g are dependent functions, their Jacobian is identically zero. 
[Use/fl = gUu ■ * • 

2. Two functions / and g ol x and y are dependent if their Jacobian J is 
identically zero. 

[Unless ^ is a constant function of /, we may take dg/dx not identically 
zero and hence employ g and y as new variables in place of x and y. Then 
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do dF 

f - Fig, y)t and J reduces to — 

dx dy 


] 


3. If fix, y) is 3. binary form of order p, then (Euler) 


dx dy 


4. The Hessian of {ax -\-hy)^is identically zero. 
[It is sufficient to prove this for x^. Why?] 


5. Conversely, if the Hessian of a binary form / (a;, y) of order y is identically 
zero, f is the pth power of a linear function. 

[The Hessian of / is the Jacobian of df/dx, df/dy. The latter are depend- 
ent by Ex. 2. Thus 


h — 
dx 


dy 


where a and h are constants. Solve this with Euler^s relation in Ex. 3, find the 
derivatives of log /, and integrate.] 

6. If the Jacobian of a quadratic form / and a linear form Z in a; and y is 
identically zero, / is the product of P by a constant. 

7. If a form / in g variables is expressible as a form in fewer than q variables, 
the Hessian of / is identically zero.^ 

8. The Jacobian of any binary form / and its Hessian h is a covariant of 
index 3 of J. 

9. / = ax^ -f" 2bxy + cy^ and g = px^ -{- 2qxy + ry'^ have the invariant 
g =r ar — 2bq + cp of index 2. 

[The discriminant of f tg is an invariant for every constant Z.] 

10. A binary quadratic form / and a linear form I have a linear covariant 
J, their Jacobian. Show that the Jacobian of J and I is an invariant of index 2 
of the system / and I, 

11. For/, I, J in Ex. 10, prove that the points represented by / == 0 are 
separated harmonically by the points represented hy I = 0 and J = 0. 


6. The discriminant of a binary cubic form. Write 

(6) f(x, y) = ax^ + Sba? y + Zcxy"^ 4- dy®. 

^ The converse is true when q is less than 5, but not when q exceeds 4 (as believed 
erroneously by Hesse). See Gordan and Nother, Math. Annalen, 10, 1876, 564. 
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Its Hessian is 3Qh, where 

h = rx^ + 2sxy + ty^, r = ac — b^, 2s = ad — be, 
t = bd- c\ 

Under any linear transformation of determinant A, let / become 
(8) F = + ZBe n + + ByK 

Denote the Hessian of F by 3611. Since A is a covariant of index 2 
of/, 

H = + 2Siri + T'r,^ = d? h, 

R = AC ~ B\ 2S = AB- BC, T = BD - C\ 

Hence r, 2A2 5 , t are the coeflScients of a binary quadratic 
form which our transformation replaces by one having the coeffi- 
cients R, 2S, T. Since the discriminant of a binary quadratic form 
is an invariant of index 2, we have 

- 52 = A 2 { A 2 r-AH- (A^ 5 ) 2 } == A® {rt - s^). 

Hence^ rt — is an invariant of index 6 of /. We shall call 
(10) — 4(ri — 52 ) = (ad — bcY — 4(ac — b^) (bd — c^) 

the discriminant of the binary cubic form (6). 

To justify this definition, we shall prove 

Theorem 3. The invariant (10) is zero if and only if fix/y^ 1) 
= 0 has a multiple root, i.e., if f(x, y) is divisible by the square of a 
linear function of x and y. 

For, if the latter be the case, we can transform / into a form (8) 
having the factor then C == D = 0 and the function (10) written 
in capital letters is zero, so that the invariant (10) itself is zero. 

^TMs result and those of Exs. 8 and 10 of §5 illustrate the theorem that any 
invariant (or covariant) of one or more covariants of a system of forms is an in- 
variant (or covariant) of the forms. 
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Conversely, let (10) be zero, so that the discriminant of (7) is 
zero. Then h is the square of a linear function Apply a linear 
transformation which expresses x and y in terms of J and any 
function y independent of it. We now have (9) with h = 
Hence S = 0, T = 0. H D - 0, then C = 0 (by T = 0) and (8) 
has the factor If Z> 5=^ 0, we have 


B == 




F = D(^ + ||)3. 


In either case, / = F is divisible by a square of a linear function. 


7. Canonical form of a binary cubic form; solution of cubic 
equations. 

Theorem 4. Any binary cubic form can be transformed into 
+ F® if its discriminant is not zerOj and into Y or X^ if its 
discriminant is zero. 

Since the last statement has been proved, let the discriminant 
(10) of the form (6) be not zero. Then, if we discard the factor 
36 from all Hessians, the Hessian (7) is the product of two linearly 
independent factors tj. Hence / can be transformed into a form 
F of type (8) whose Hessian (9) reduces to whence E = 0, 
T = 0 , 87 ^ 0 , If C = 0, then B = 0 (by E = 0) and F = 

+ Drf, where AD ^ 0 (by S 9 ^ 0), Taking 

^ X, rj == 7, 


we get F = + F^, as desired. The remaining case C 9 ^ 0 is 

readily excluded; for, then B 9 ^ 0 (by T = 0 ) and 
52 C2 

ad ^ BC, S - 0 . 

C B 


Hence to solve a cubic equation c{v}) = 0 without a multiple 
root, reduce the cubic form fix^y) ^y^c{x/y) to the form 
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+ Dirf by employing the factors ^ and tj of the Hessian of /• 


Exercises 

1 . The Hessian of / = 2/ -j- 12x2/® + dy^ is 36 (d — 8) (xy + 2 ^®)* 

When its factors ^ = x + 2 ?/ and t? = 2/ are taken as new variables, / becomes 

+ (d - 8 )^ 3 . 

2 ^ For d = 9 in Ex. 1 , x/y = — 3 , — 2 — co, or — 2 — a;®, wherer^*? is an 
imaginary cube root of unity. 

3 . What are the roots when d — 7 ? 

4 . The discriminant of x® — 27 x 2 /® 4 * 542/^ is zero. 

8* Homogeneity of covariants of a system of forms. 

Theorem 5. A covariant which is not homogeneous in the vari- 
ables is a sum of covariants each homogeneous in the variables. 

Let the system of forms have the coefficients a^by . . . , arranged 
in any order. If is a covariant of the forms, 

J 5 l (A, ; 1, 17, . . Z(a, 6, . • • ; a:, I/, • . .). 

When x^yj . . , are replaced by their linear expressions in f , 
rjj ... y the terms of order ni on the right (and only terms of order 
ui) give rise to terms of order % in 17, ... on the left. Hence, if 
Ki is the sum of all the terms of order ni of K, we have 

By ... y JBil (Ofy 6 , • • • j Xy 2 ^, • • •), 

SO that iTi is a covariant of the system of forms. Similarly, the 
sum Kz of all the terms of order 7i2 of if is a covariant. Hence K 
is a sum of covariants each homogeneous in the variables. 

Theorem 6. When a covariant of two or more forms is not 
homogeneous in the coefficients of each form separately y it is a sum of 
covariants each homogeneous in the coeffikients of each form sepor 
rately. 
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The proof is entirely similar to that used in the following case. 
Employ the notations of Ex. 9 of §5 and write d — ac — W. 
Then s + Sd is an invariant of index 2 of the forms / and g. Let a 
linear transformation of determinant A replace / and g by 

F = + 2Birt + Cri\ G^Pe + 2Q^v + Rri^ 

so that Aj Bj C are linear functions of a, 6, c, while P, Q, R are 
linear functions of p, g, r. By hypothesis, 

AR - 2BQ + CP + 3(A0 - B^) = A^is + 3d). 

The terms ZdA^ of maximum degree 2 in a, 6, c on the right arise 
only fropa the part 3 (AC -r JS^) on the left. Hence d is itself an 
invariant of index 2; likewise s is itself an invariant. 

In view of Theorems 5 and 6, we may and shall henceforth 
restrict attention to covariants which are homogeneous in the 
variables (of constant order n) and homogeneous in the coefficients 
of each form separately. 

Theorem 7. Let K be a covariant of order n and index I of the 
forms fi) . • . jfrof orders pi, . . , , pr the same variables Xi, . . . , Xg. 
Let K be of degree d^ in the coefficients of /^. Then di — n 
= qL 

Let fi have the coefficients = 1, 2, . . .). Apply the trans- 
formation 


xi = myu . . . , Xg = my^ 


of determinant m®. Then fi becomes a form having the coefficients 
Cii = m^i Thus 

K (m^i Cl/, . . . , m^r Crf, Xi, . . . , xf) 

= (m^yK(cii, . . . , Cr/,* Xi, . . . , Xg), 
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identically in m, Xi, . . . , Xq, cij, . . , , Crj, where each j has its 
range of values. We may remove the factor and place the 
factor in front of the left member. Since K is of degree in 
the Cij(j = 1, 2, • . .), 

m^i^i • • * m^T^r = 

which proves the theorem. 

If, when r = 1, we do not assume that K is of constant degree 
di, but consider one term of degree di, we see that the proof gives 
Pi di — n = qL Hence di is the same for aU terms. This proves 

Theoeem 8. A covariant of constant order n of a single form f is 
of constant degree in the coefficients of /. 

9. Weights of the coefficients of a covariant of a binary form. 
It is now necessary to employ the subscript notation for the coeffi- 
cients of the form 


. . f - aoX^ + pai xp-^ 2/ + • • • 

+ ( i) Qi y* + h Op 

the binomial coefficients p, etc,, being prefixed to avoid fractions 
in the later work. The weight of a a: is defined to be k and that of a 
product of factors a & to be the sum of the weights of the factors. 

To make (6) conform with our present notation (11) for p = 3, 
replace a, 6, c, d by oo, Ui, (hj as. Then the Hessian (7) becomes 


( 12 ) 


36 (rx^ + 2sxy + ty^)j 
7* = <j(} ^2 ~ 2s ^ ao as — ai Oz, 


t — ai as — o^. 


The terms of r are of constant weight 2, those of s of weight 3, and 
those of t of weight 4. Since the Hessian is of index 2, this illus- 
trates 
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Theorem 9. In a covariant of order n and index I, 

(13) ' K =' tSria ) x -' r , 

r=o 

of a single form (11), all terms of the coefficient Sr (a) are of the 
same weight I + r. In particular, the weight of the leader So (a) of 
the covariant is equal to the index 1. 

The transformation x = y — mn replaces / by a form whose 
literal coefficients are By the definition of a covariant 

K of index I, we have 

(14) 2] Sr (A) Sr(a)x^-^y^. 

r=0 

Let t = cao^o ai^i • • • Op^p be any term of Sr (a); the weight of t 
is + 262 + • • • + pep. The left member of (14) has the 

term 

cAo^o Ai^i • • • Ap^p 7)^ = tx-^-^ y^ 

which must be equal to a term of the right member of (14). 
Hence Wr — r — 1. This proves Theorem 9. 

Each Sr(ci) is isobaric, being of constant weight. If we attribute 
the weight 1 to x and the weight 0 to y, we see that every term of 
(13 ) is of total weight I + n. 

Corollary. A homogeneous polynomial is a covariant of f with 
respect to every transformation x = y — mn, if and only if it is 
isobaric. 


Exercises 

1. Verify Theorem 9 for the covariant / of /. 

2. For an invariant of index I and degree d of one binary form of order p, 
pd = 21 Here I is also the weight of the invariant. 

3. Hence no binary form of odd order has an invariant of odd degree. 
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10. Semijavariants. A homogeneous, isobaric polynomial in 
the coefficients of a binary form / is called a seminvariant of / if it is 
an invariant of / with respect to all transformations of the type 

Tfc : a; = ^ + fctj, y v- 

Let Aqj Ai, . , , be the literal coefficients of the form obtained 
from (11) by applying transformation T*. Although not essential 
to our discussion, note that 

(15) Aq = ao, Ai = ai + kao, A 2 - (h + 2kai + F Gq. 

The first equation shows that ao is a seminvariant of /. Elimina- 
tion of k between the last two equations gives 

A 0 A 2 — Ai^ = ao 02 — Oi^, 

whence Oo 02 — Oi^ is a seminvariant of /. It is the leader of the 
Hessian of f tor p = 3, as noted before Theorem 9, Evidently Oo 
is the leader of the covariant / of /. These two cases illustrate 

Theorem 10. The leader of every covariant K of any binary 
form f is a seminvariant of /. 

The leader Sq (a) of any covariant (13) is isobaric by Theorem 9 
and is homogeneous by Theorem 8. Since K is & covariant of / 
with respect to every transformation Tk of determinant unity, 
we have 

K =j:SriA)e^-’-r =J28r(A)ix - kyy-r yr. 


A comparison of the coefficients of with that in (13) gives /So (a) 
= So (A). Hence So (a) is an invariant of / with respect to every 
transformation This completes the proof of Theorem 10. 

By Tfc, a; — r2/ = ^ ~ (r — k)7j. Hence each root of f{x/y, 1 ) 
= 0 is diminished by h Thus the difference of any two roots is 
unaltered. 
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Since = 0 in (15) if we take k — — ai/ao, we see that the 
resulting transformation replaces / by the reduced form 

^ = ao p + (2)62 ri ^ + (3)63 + . . . , 

where 

52 = 02 — aiV^^Q, bz = az — 3ai 02^0 + 2aiV^^o®* 

The roots of gr = 0 are 

Ti + ai/ao = [(r^ - n) + h (r^ - r^)]/v 

(i = 1, . . . , p). 

Hence each root of ^ = 0 is a linear fimction of the differences of 
the roots of / = 0 and thus is unaltered by every transformation 
Tk- The latter is true also of h^/aQ, bz/ao, . . . , which are equal to 
numerical multiples of the elementary symmetric functions of the 
roots of gr = 0. Hence the homogeneous isobaric polynomials 

S2 — do bz ^ cio Ci2 — 

(16) - Sz = ao^ bz = ao^ az — 3ao ai 02 + 2ai®, 

= Uo® 64 = do^ dA — 4 ao^ di dz + 6ao di^ (h — 3 ai‘*, 

etc., to Sp are invariants of / with respect to all transformations 
T hi and hence are seminvariants of /. 

Since g was derived from / by a linear transformation of deter- 
minant unity, every semin variant S of f has the property 

S(dQj . . . , dp^ ~ S (dof 0, b2j * * • , 5p) 

= S(ao, 0, &/ao, . . . ,'/Sp/ao^”0* 

Theorem 11. Every seminvdrmnt of f is the quotient of d 'poly- 
nomial in the seminvdriants Uo, S 2 , ••• f Sp by a power of do. 

We shall presently apply this theorem to obtain all semin- 
variants when p < 5. Among them occur the leaders of all 
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covariants by Theorem 10. Our next problem will then be to find 
all covariants K having a given leader S. As an aid to its solution, 
we now prove 

Theoeem 12. If S is a seminvariant of degree d and weight w of 
a binary form f of order p, any covariant K of f having the leader S is 
of known order n pd — 2w, and at most one such K exists J- 

By Theorem 9, the weight of /S is the index I of K. By Theorem 
7 with r = 1 , ^ = 2 , we have pd — n — 21. Hence the order n of 
K is uniquely determined by S. 

If there exist two distinct co variants S + • • • of /, they have 
the same index I — w, whence their difference is a covariant of / 
having the factor y. It is of the form yq{aQ, . . . , a;, y). Let 

7 } be any chosen linear function of x and y and select another such 
function ^ which is independent of ??. In view of the transforma- 
tion of determinant A which expresses x and y linearly in terms of 
^ and y, we have 

■nq(Ao, . . . , Ap; ri) = A* yq{ao, , a^; x, y). 

Hence yq is of order n and has as a factor every 77 , This contra- 
diction completes the proof of Theorem 12. 

CoROLLAEY. The weight of an invariant of degree d of a binary 
form of order p is ^pd. 

11. Theorem 13. Every binary linear transformation is a product 
of transformations of the three types 


Tr.: 

X == ^ + nri, 

y = V, 

Sm. 

X = 

y = my (m 9^ 0); 

V: 

X = - y, 

2/ = |. 


From these we obtain F® == x = rj, y = — and 

^That one such K always exists is proved in § 16 . 
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Rn = T-. 7: a: - X, y^Y + nX; 

Pm = 7“' Sm 7: x = mX, y = Y (m 5 ^ 0). 

Then x = + It], y = + drj, D — ad — be 9 ^ 0, is equal to 

Sd P-D/d Phd/'oRc/d {d 9^ 0), Sc P^h T-a/b 7 (d = 0), 

Applying the corollary to Theorem 9, and noting the effect of 
transformation 7 of determinant unity, we obtain the 

Corollary. A pol 3 niomial in ao, . . . , x, y which is homo- 
geneous in X and y is a covariant of the p-ic (11) if and only if 
it is isobaric, is covariant with respect to every Tn, and is un- 
altered when X is replaced by p, p by — x^ and by (— l)^‘aj,_,- 
for z = 0, 1, . . . , p. Hence a seminvariant which is unaltered by 
the last replacements is an invariant. 

12. Fundamental system of covariants of the binary p-ic for 
p < 5. We shall exhibit a finite number of polynomial covariants 
Kij , . . , Kn of the binary p-ic / such that every polynomial 
covariant of / is expressible as a polynomial in iTi, . . . , Nn with 
numerical coefficients. Then Ki, , , , ^Kn are said to form b, fun- 
damental system of covariants of /. 

We shall first exhibit certain seminvariants of / such that every 
seminvariant is a polynomial in them. Each of them wiR be seen 
to be the leader of a known covariant of /. Our problem is thereby 
solved, since no seminvariant is the leader of two covariants by 
Theorem 12. 

For p = 1, Theorem 11 shows that every seminvariant of 
/ = ao a; + ni y is a polynomial in ao* Since ao is the leader of the 
covariant / of /, we have 

Theorem 14. Every polynomial covariant of f = ao x + aiy is 
a polynomial inf. Thus f itself constitutes the fundamental system of 
covariants of /. 
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For p = 2, Theorem 11 shows that every seminvariant is the 
quotient of a polynomial P(ao, S2) by ao\ But S2 is not divisible 
by Go. Hence P is divisible by ao only when all of its terms have 
the explicit factor ao. Cancelling the factor ao from both P and 
Uo®, we reduce the problem to a like one having c — 1 in place of 
e, and finally to the case 6 = 0. Hence every seminvariant is a 
polynomial in ao and Since ao is the leader of the covariant 
f of /, while S2 is an invariant of / by § 4 , we have 

Theorem: 15 . Every polynomial covariant of the binary quadratic 
form f is a polynomial in f and its discriminant 82^ 

A different treatment is necessary when p > 2 since there are 
semin variants which are not polynomials in ao and the Si. Write 
Si for the terms of Si free of ao,* by ( 16 ) 

( 17 ) S2 = — ai^, S3 = 2 ai®, S4 == — Sax'*. 

For p = 3 , 4S2^ + ^ 0, We find that 

( 18 ) m + Sz^^ao^D, 
where, in accord with (10) and (12), 

( 19 ) D = — 6ao ai 02 03 + 4 ao 02® + 4 ai^ az — Sai^ 02^ 

is the discriminant of the cubic form/. By means of ( 18 ) we may 
eliminate Sz^ and higher powers of Sz from any polynomial in 
cto, Sz and conclude from Theorem 11 that every seminvariant 
is of the form Pfao^^ where P is a polynomial in ao, Sz, D of 
degree 0 or 1 in /Sa. If e = 0, P is a seminvariant. Henceforth 
let 6 2 1 . In case every term of P has the explicit factor ao, we 
cancel it from P and ao® and hence reduce the problem to a Hke 
one having 6 — 1 in place of e. It remains only to treat the case in 
which not every term of P has the explicit factor ao. Then the 
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expression for P in terms of ao, ai, 02, az is not divisible by ao 
(i.e., P does not have the implicit factor ao). For, if so, then. 

P' = P(0, S2, S3, d) = 0, d = 4ad az — Sai^ 02^. 

Since az occurs in d, but in neither S2 nor S3, we conclude that P' 
is free of d. The first power of S3 == 2 ai^ is not cancelled by a poly- 
nomial in S2 = — Hence P' is free also of S3 and therefore 
of S2. 

Hence every seminvariant of the cubic form / is a polynomial in 
ao, S2, Sz, D, They are connected by the syzygy (18), i.e., a relar- 
tion not serving to express any one of the four as a polynomial in 
the remaining three. 

Theorem 16. A fundamental system of polynomial covariants 
of the binary cubic form f is given by f, its discriminant D, its Hessian 
36JT, and the Jacobian ZG of f and H. They are connected by the 
syzygy 

( 20 ) + 

This follows since ao, S2, Sz are leaders of the covariants/, H, (?. 

For p = 4, we see from (17) that the simplest relation is 
SSs^ + S4 = 0. We find that 

S4 -f” ZS^ = a^ J, J = ao a4 — 4 ai az + 

We employ this identity to eliminate S4. Consider polynomials 
P(ao, S^j Sz, I) having ao as an imphcit, but not explicit, factor. 
Such a polynomial is given by (18). For ao = 0, D = — ai^I 
= S2 /; we find that 


S2 1 — D = ao J, 

J sz ao 02 ai — ao az^ + 2ai 02 a% — ai^ — 02*. 
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Elimination of D between this identity and (18) gives 
(21) ao^ J -ao^S2 1 + 4 ^ 2 ^ + >83^ - 0. 

The latter enables us to eliminate and higher powers of Ss 
from P to obtain a polynomial Q in ao, Sz, I, J of degree 0 or 1 
in /S3. If possible, let Q have ao as an implicit, but not explicit, 
factor. Then 


Q(0, — ai^y 2ai^, — 4ai as, /) = 0. 

In view of the term — ai^ a4 of J, Q cannot involve J, and similarly 
not I. Nor can Q be of degree 1 in /S3 in view of the odd power ai^. 
Hence Q is free also of S3 and therefore of S2. 

Hence every seminvariant of the quartic / is a polynomial in 
ao, S2, S3, J, /. They are connected by the syzygy (21). 

Denote the Hessian of / by 1445', and the Jacobian of / and H 
by 8G, Then ao, S2, and S3 are the leaders of the covariants /, H 
and G of /. By the corollary in §11, the seminvariants I and J are 
invariants of /, since they are unaltered when we replace ao by a4, 
a4 by ao, ai by — a3, and as by — ai. 

Theoeem 17. A fundamental system of polynomial covariants 
of the binary quartic form f is given by /, ^ts invariants I and J, its 
Hessian 1445, and the Jacobian 8G of f and 5. They are connected 
by the syzygy 

( 22 ) pj + + = 

13. Canonical form of a binary quartic form f; solution of 
quartic equations. We shall exclude the uninteresting case in 
which f{x/y,l) = 0 has a multiple root. Since there are four 
distinct roots, / (x, y) is a product of four linear forms, no two of 
which have proportional coeiOdcients. Taking any two of them as 
new variables J and 77, we may reduce / by a linear transformation 
to the form where 
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Then gk 0 since neither | nor ^ is a factor of q. Choose r, s, t 
so that 

= k/g, r + s = g, r — s = h/t. 

Then 


^v^it-^ia + trjy- i^-tvn 

q = ri^ + + si^ - t7i)\ 

Introducing the new variables 

we see that / = takes the canonical form 
(23) X4+ Y^ + ^mX^YK 

The presence of the arbitrary constant m is explained by the 
existence of a rational absolute invariant P/J^, i.e., one of index 
zero, which therefore has the same value for / as for any form, like 
(23), derived from / by linear transformation. For, the invariants 
I and J are of weights (and hence indices) 4 and 6, so that P/P is 
of index zero. 

In the preceding proof of the existence of a canonical form (23), 
we made use of the roots of / ix/y, 1) = 0. We shall now show how 
to obtain the canonical form without assuming that the roots are 
known. We have proved the existence of a number m and a linear 
transformation T of determinant A 9 ^ 0 which expresses X and Y 
linearly in terms of x and y, such that (23) becomes the given 
quartic / under this transformation T, To compute the values of 
I and J for (23), take == 1, = 0, 02 = m, az = 0, ^4 = 1; 
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then I and J have the values 1 + and m — respectively. 
Since I and J have the indices 4 and 6, we get 

f = X^+Y^ + 6m 
I = A^(l + 3m^), J = A^(m — m^), 

where I and J refer to /. The Hessian of (23) is 

Z2 + mF2 2m XY 

^ 2mXY Y^ + mZ2 * 

Hence if the Hessian of / is denoted by 144ff , a co variant of index 
2, we have 

H = A2{m(Z4 + F4) + (1 ■“ 3m2)Z2 y^}. 

We first compute a root A^ m of the cubic equation 

(24) 4(A^ my — I (A^m) + J = 0. 

Then A^ = J — 3(A^ my. Choose either square root as A^. From 
the value of A^ m, we get the value of m. Eliminating X^ + F^ 
between the expressions for / and H, we get 

A2 m/ - H = A2(9m2 - 1)X^ Y\ 

This gives XF by extracting the square root of a form which is a 
perfect square. For, if 9m^ = 1,/ would be the square of X^ + F^, 
contrary to the hypothesis that there is no multiple root. From 
XF and/, we get X^ + F^ and then X + F and hence X and F. 

Theokem 18. Every binary quartic form f{x, y) not divisible by 
the square of a linear form can be reduced to a canonical form (23). 
We can find m and the linear functions X and Y of x and y by finding 
a root r of a cubic equation (24), whose coefficients are the invariants 
I and J of /, and by extracting the square root of rf — H, where 144jff 
is the Hessian of /, 
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We can express (23) as a product of linear factors by solving 

+ 1 + Qmz = 0, where z = 

Theorem 19. To solve any quartic equation q(w) = 0 not having 
a multiple root, reduce the quartic form fix^y) = y^ Q.{^/y) io its 
canonical form (23 ) and express it as a product of linear factors by 
employing square roots of known numbers. To solve the auxiliary 
cubic equation (24), reduce the corresponding cubic form to one 
involving only the cubes of the linear factors of the Hessian of that 
cubic form ( §7). Hence we can solve any quartic equation by extract- 
ing a cube root and certain square roots of numbers determined by the 
invariants I and J and the Hessian of the corresponding quartic 
form f. 

For tables of invariants and covariants, see Fak di Bruno, 
Theorie der Binaren Formen, Leipzig, 1881. 

Exercises 

1. If J = 0, A 7 ^ 0, then Zm^ = — 1. Show that I = 0 for the quartic 

— iTw — = 0 (which arises in the theory of inflexion points 

on a plane cubic curve). Find its quadratic factors. 

2. Find the three canonical quartics when / = 0. Apply to solve cw^ 
± cw — 1 =0. 



Chapter II 

FURTHER THEORY OF COVARIANTS OF BINARY FORMS 

We shall here prove some fundamental theorems on invariants 
and covariants which were merely verified in the introductory 
chapter for binary forms of orders S 4. For example, we shall 
prove that all covariants are expressible as polynomials in a finite 
number of covariants. Suggestions for supplementary reading are 
made at the end of the chapter. 

14. Annihilators of covariants. Consider a binary form 

( 1 ) f =T,Q)o'iXP-^y\ 

i*=Q 

having prefixed binomial coeflicients, as in §9. Under the trans- 
formation 

Tnl X== ^ + 717), 2 / = Vf 

let / become 

(2) 

i=0 

where the A t are polynomials in n, Oo, . . . , whose actual ex- 
pressions will not be needed. Differentiating (2) with respect to 
n, we get 

0 = S © I*”-* r - Ai(p - 

since v = free of n, while ^ = x — nt). The total coefficient of 
p-’ )?’ is 


24 
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in which the second term is to be suppressed when j = 0. Since 


(?) = G^) 


p \ip -j + 1 ) 


we evidently get 


= Mj-i (i = i, 


Consider the polynomials 

k = K (ucj • • • ) Oipj Xj y')j K = Fl(Ao) • • • j Apj v')* 


By (3), 


_ 'f -J 

dn ^dA,dn a| an '"'aA, a|’ 

The polynomial is said to be a covariant of (1) with respect 
to every transformation Tn (of determinant unity) if and only if 
K = k becomes an identity in n, x, 2/, Uo, . . . , a^, after ^ is replaced 
by a: — ny, rj by y, and each A i is replaced by its expression in 
terms of Uj ao, . . . , ap. Since k is free of n, K = k implies that 


Conversely, the latter implies that K is independent of n and hence 
is equal to the value k to which it reduces when n = 0, since 
Tn is then the identity transformation x — y = 7 }, which re- 
places (1) by a form (2) with the same coefl&cients Ai = ai. The 
condition is therefore that the final expression in (4) be zero 
identically in n, x, y, Uo, . . . , Cp. In particular, it must be zero 
when n = Oj whence 


Qk y 


dk 

dx 


-0, 


V 


3“1 


d 

daf 


( 5 ) 
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identically in x, y, Oo, . . . , Op. From this identity we evidently 
obtain an identity when we replace x, . . . , cip hy any quantities 
such as I, 7 ], Aq, , Ap. Thus the final expression (4) is zero 
identically in the latter quantities. Hence A: is a covariant of (1 ) 
with respect to every transformation Tn if and only if (5) holds. 
We shall then say that fc is annihilated by the differential operator 

( 6 ) = + 

The form (1 ) is unaltered if we interchange x and y, and inter- 
change Ci and ap^i for i = 0,1, ... ,p. Hence fc is a covariant of 
(1) with respect to every transformation 

Rn'. X = y = 7) A wf, 

if and only if fc is annihilated by the operator 




d , 


•+• (p - t)a.-+i — + 1- 

dct% 


— )• 

dCLp—i^ 


By the corollary in §9, i is a covariant of (1) with respect to 
every transformation Sm: x = y = m??, if and only if fc is 
isobaric. Since T_i Ri r_i = V, viz., x = — 7, 2 / = X, it follows 
from §11 that every binary transformation is generated by the 
Tny Rn} Sm* 


Theokem 1. A polynomial K(aQ, , , , ^ ap; Xj y) is a covariant 
of the binary form (1 ) if and only if it is isobaric and is annihilated^ 
by the two operators (6) and (7). 

CoROLiiARY 1. A polynomial in Oo, . . . , otp is an invariant of 
(l) if and only if it is isobaric and is annihilated by 0 and 0. 


^In computations, it is simpler to employ the single annihilator (6). We apply 
the corollary in §11 with condition concerning replaced by the equivalent con- 
dition that the polynomial be annihilated by (6), 
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For example, the weight of an invariant I of degree 2 oi OqX^ 
+ 2ai xy + 02 is 2 by the corollary at the end of §10. Hence 
I = roo 02 + sa^. Then 0,1 = 2(r + s)ao Oi = 0, OJ = 2(r+ s)- 
0 x 02 ^ 0, Either condition gives 5 = ~ r, I = r(oo 02 — Oi^). 

By the definition of seminvariant in §10, we have 

CoEOLLAKY 2. A homogeneous, isobaric polynomial in Oo, . . . , 
Op is a seminvariant of (1) if and only if it is annihilated by 0. 


15. Commutators. Express each of the annihilators 0 and 0 
of invariants in the following two forms: 


P Q 3>-l Q 

^ = HjO'i-i = ^(^ + 1) 


J=1 


da 


3 ft=0 


dojc+i 


0 =Z(P - j 4- l)a, =E(P - k) ak+i ~ 

da/_l dOjfc 


E 

7«1 


Then 


flO = Ei^j-if (p - J + 1) 1-S(p - k)ak+i ^ - 1 

dOj^x ;t=o do,* dOfc-J 


OQ =^(p - fc)afc+ir(ft + 1)— ^ l-Ejaj-i ^ 1- 

*- dOjcJr^ uOa dOj-J 


The terms involving second derivatives are identical. Thus 
00 — 00 involves only first derivatives and is called the com- 
mutator (alternant) of 12 with 0. In the first terms of 120, write 
j = i + in. those of 012, write = ^ — 1. Hence 

QO — Oa='^{i + l)a,(p - i )~ — E(p “ * + 

i_0 i-1 


p 

= E(p ~ 2i)ai 

i»*0 


do* 
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If S is a homogeneous function of ao, . . . , ap of (total) degree d 
and hence is a sum of terms of type 

t ^ cao®o • • * Up®?’ (co + + * • • + Cp = d), 

we readily verify Euler's theorem (cf. Ex. 3, §5): 

« ==n dCii 


US is isobaric, it is a sum of terms t in which 


Cl + 2 c 2 + -• + pep - w = constant. 

Then 

P P P 

^ iai — ^ ^ t^wt, X) iai — ^ wS. 

TlJ da,- ^ dUi 

Theobem 2. 1/ S is both homogeneous (of degree d) and isobaric 
(of weight w) in ^ Up, then 

(8) (00 — OQ.)S ^ nS, n ^ pd — 2w. 

This result is the case r == 1 of 


(9) (00" - 0" 0)/S ^r(n-r + l)0"--i S. 

To prove the latter by induction on r, assume that it holds when 
r = k, and note that OS is of degree d and weight + 1, so that 
when (9) is employed with S replaced by OSj n must be replaced 
by^ — 2. Hence 

(00^+1 ~ O^-^i 0)>S = (00^ - 0^ 0)05 + 0^(00 - 00)5 
^k(n-2-k + l)0^-i OS + nO^ S 
= (k + l)(n - k)O^S, 

and (9) holds for r = + 1. The induction is therefore complete. 
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Similarly, since QS is of degree d and weight w — 1, 

(10) (00^ - 0)S = r(- 71 - r + S. 

16. Existence of a covariant with a given seminvariant leader. 
In §12 we verified for p ^ 4 that every seminvariant of the binary 
form / of order p is the leader of a covariant of /. We shall now 
prove that a like result holds for any p. 

Lemma. If S is a seminvariant, not identically zero, of degree d 
and weight w of a binary p-ic, then pd — 2w ^ 0. 

Suppose on the contrary that >S is a seminvariant for which 
n = pd -- 2w iQ negative. Since QS ^0, (9) gives 

(11) S ==r(n-r+ 1)0^-^ S (r = 1, 2, ) 

and no one of the coefficients on the right is zero. We recall that 
the weight of OP exceeds that of P by unity. The maximum 
weight of a pol 3 momial of degree d in ao, . . . , is pd, which is 
the weight of Since S is of degree d and weight pd 

+ 1, it is zero identically. Thus (11) for r — pd — w + 1 gives 
Qpd-w g ^ 0. Then (11) for r = pd — tu gives S = 0. 

Proceeding similarly, we get OS ^ 0 and then S = 0, contrary to 
hypothesis. 

By Theorem 12 of §10, any covariant with the leader S is of 
order n ■= pd — 2w, Give it the notation 

K = Sx^ + Si y + S 2 x‘^^^ + - ■ + Sny""^ 

The result of applying the operator (7) to K is 

(OS - Si)x- + {OSi - 2S2)x-^y 

+ h (OSn-i - nSrf)xy^-^ + 0& y% 

which is zero identically in x and y if and only if 

(12) Sx^ + OSx^’-^ y + ^O^ Sx^-^ + h 0- Sy% 

n . 
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and 0^+^ iS ^ 0- To show that the last condition is satisfied, note 
that (11) for r = n + 1 gives 00^+^ /S = 0. Since S is of 
degree d and weight W — w + n+ l = pd — w + 1 and is 
annihilated by 0, it is a seminvariant by corollary 2 of §14. It is 
zero identically by the lemma since pd — 2Tr == — (pd — 2w) 
— 2 is negative. Hence (12) is annihilated by the operator (7). 

In the result of applying operator (6) to (12), the coefficient 
of is 

I J20' S - ■ \: - v (n - r + l)0-i S, 

r! (r ~ 1)! 

which is identically zero by (11). Hence (12) is a covariant by 
Theorem 1. 

Theorem 3. There exists a unique covariant of a binary p^-ic 
whose leader is any given seminvariant of the p-ic. 

17. Hilbert’s theorem 4. Any set S of forms in Xi, . . . , Xn con-- 
tains a finite number of forms Fi, . . . , F* such that every form of the ' 
set can be expressed as a linear combination of Fi, . . . , F& with 
coefficients which are forms in xi, . . . , Xnj but are not necessarily in 
the set S. 

For n = 1, S is composed of certain forms Ci x^i, c% x% .... 
Let Cm be one of the minimum exponents and take Fi — Cm, x^^n. 
Every form of the set S is the product of Fi by a factor of type kx^, 
where e ^ 0. Hence the theorem holds with & = 1 when n == 1. 

To proceed by induction on n, let the theorem hold for every set 
of forms in — 1 variables. To prove it for the set /S, we may 
assume that S contains a form Fq of (total) order r in which the 
coefficient of Xn"' is not zero. For, let/(a;i, . . . , x^) be a form in 8 
which actually involves Xn- Then there exist numbers Ci, . . . , 

(cn 9^ 0) for which c ==/(ci, Cn) is not zero. The trans- 

formation 


Xi ^ yt + CiPn (i = 1, . • . , n ~ 1), 


Xn — Cfi Pn 
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is of determinant 5 =^ 0 and replaces / by a form i^o( 2 /ij * ^ * ,yn) 
in which the coefficient of yn'" is evidently c. 

Let F be any form of the set S, Dividing F by Fo, we get 
i?’ = Eq P -f JSj where P is a form whose order in is < r. In 
P we segregate the terms whose order in Xn is exactly r — 1 and 
have 

F S Po -P + + N, 

where ikT is a form in 0 : 1 , while iV* is a form mxi, , , . ,Xn 
whose order in a;,! is ^ r — 2 . Each F uniquely determines an M, 
Since the theorem is true by hypothesis for the set of forms M, 
there exist a finite number Mi, Mi of them (corresponding 
to certain forms Fi, Fi of S), such that every M can be ex- 
pressed as 

ikf=/iMi+ 

where the fi are forms in , Xn-h Then 

F^F,P + N + Xn'-ifi Mi, Fi ^FoPi-\- Mi x„r~i + Ni, 

whence 

F^FoP' +Ef^P‘ + R', P'=P-Ef,Pi, 

R' = 

Each exponent of Xn in P' is ^ r — 2; we segregate its terms for 
which that exponent is exactly r — 2 and have 

F = FoP' +j2f.Fi + M' 


where Jkf ' is a form in , Xn~i, and N' is a form inxi, . , , ,Xn. 

whose order in is ^ r ~ 3. 

Since the theorem holds for the set of forms M\ each is a linear 
combination of M \, . . . , M'm, corresponding to certain forms 
Fi+iy. , FiJ^^ of S, As before, F differs from a linear combinar 
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tion of Fo, . . . , Fz+^ by Af" where Af'' is a form in 

oji, . . , , Xn^ij and iV" is a form whose order in is Sr — 4. 
Proceeding similarly, we see that F differs from a linear combina- 
tion of certain forms Fo, . . . , Fj of >S by a form lacking Xn- One 
more step leads to the theorem. 

18. Finiteness of a fundamental system of covariants. For 
simplicity, we first consider only invariants of a single binary 
form / with the coefficients oo, , . , , ap. By Theorem 8 of §8, 
every invariant of / is homogeneous in the a^s. By the preceding 
theorem, there is a finite number of these invariants /i, . . . , 
in terms of which every invariant 7 of F is expressible linearly: 

(13) 7 = Fi 7i + . . . -f. Em 

Let 7 and 7 j be of degrees d and d,, respectively. By the corollary 
at the end of §10, their weights are and |pdj. Hence Ej is 
of degree Dj — d dj and weight ^pDj, 

We shall next obtain a differential operator D such that DEj 
is an invariant. Apply the operator to (10) ; we get 


Qr-l Or-2 ^2^ QaS = r (~ 71- r + l)0"-2 Qr~l 


In the second term replace QOS by its value OQS + nS from 
(8). Thus 


0^-^ Q^S ^ 0^-^ Q^-^[OQ - (r - 1) (n + r)]S. 

Since OQS has the same degree and weight as Sj we may employ 
the formula derived from this by replacing r by r — 1 to eliminate 
Or -2 repetitions of this process evidently obtain 

0^-^Q^S^ 0[00- (?^ + 2)][00-2(n + 3)] 

• • -[OQ - (r - 1) (n + r)]>S, 

which we may also establish by induction. Let S be of degree d 
and weight w == ^pd, so that its n in (8) is zero. Take r = tt; -h 1^ 
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apply 0^+^ S = 0, and divide by (— 2) (— 2 • 3) . . . ; we get 
0 = QDS, where 


(. 4 ) = 


oa 

2 - 3-1 


l-.-Ti- 


00 


w(w + 1)- 


Since DS is annihilated by 2 and is of constant degree d and 
constant weight w = ^pd, it is a seminvariant by Corollary 2 of 
§14. Being the leader of a covariant of order n = 0, it is an 
invariant. Since the invariants I and I ^ are annihilated by both 
0 and 0, they are unaltered by the operator D, These results 
hold also if we annex factors ^ = 1 — cOO at the right of (14), 
since gS has the same degree and weight as S. Hence if we take 
w to be the maximum of the weights of the and operate on 
(13) by Dj we get 


I ~ + • • • + Jmlm, Jj = DEj- = invariant. 


Since each is of the form (13), 

m m 

d" ? ~ ^ jk I k) I ^ ^ ^ j jfc I j Ik* 

A»=l j, A“=l 

Appl 3 dng to the last an operator D with w sufficiently large, we get 

I = 2-) E jh I f I kj 

where the Ljh are invariants of /, Since there is a reduction of 
degree at each step of the process, we ultimately obtain an expres- 
sion for / as a polynomial in Ji, . . . , Im with numerical coefficients. 
Hence h, ... ,Im form a fundamental system of invariants of/. 


Exercise 

Extend the finiteness proof to invariants of a system of binary forms 
oo + . . . , hox ^2 + , . . j cqx^s + , ... First prove as in §14 that 

an invariant is annihilated by 
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and by S 0. Next, if S is a polynomial in the o., 6„ . . . which is of constant 
degree di in the a’a, of constant degree dj in the 6’s, . . . , and of total weight ^D 
in the a’s, 6’s, . . . taken collectively, show that (8) and (10) hold if we replace 
O by 2 0, Q’' by (2 Q)', and n by 2 p, d^ - 2w. Make the like replace- 
ments in (14). 

The finiteness proof is extended to covariants by means of the 

Lemma. The set of all hcymogeneous covariants of the binary 
forms fi,...,fh is identical with the set of forms derived from the 
invariants I (homogeneous in X, Y) of fi, ... ,fh and L = Yx 
— Xy by replacing X by x and Y by y in each I. 

To simplify the notations, let a,b, . . . denote the coefficients of 
fi, ... ,fk arranged in any chosen order. Let A, B, .. . denote the 
corresponding coefficients of the forms obtained by the transfor- 
mation 

X = pu + qv, y — ru + sv, D = ps — qr 7 ^ 0. 

This replaces L by Fm — Uv, in which 

U = sX-qY, F = - rZ -f pY. 

Solving these two equations, we get 

DX ==pU + qV, DY = rU + sV. 

Let I = I (a, b, ...; X, Y) he of index 1. Then 
I(A, B,...;V,V) = D^I = Z)*-"7(a, b,...,DX, DY), 
since / is of constant order nmX,Y. Hence 
I{A,B,...-,U,V) 

= Di-n J(a, 6, . . . ; pU -}- qV, rU + sF), 
identically in U, V. We may replace the latter by u, v and get 
I{A,B,...-,u,v) = I(a,b,...;x, y). 
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This proves that J(a, 6, . . . ; x, ?/) is a covariant of fi, ... jfk of 
order n and index I — n. The argument may be reversed. 

Since every covariant is a sum of covariants each homogeneous 
in the variables ( §8), we have completed the proof of 

Theoeem 5. There exists a fundamental system of covariants of 
every set of binary forms. 

The first proof of this classic theorem was made by Gordan by 
use of the symbolic notation. Cayley had earher come to the con- 
trary conclusion that the fundamental system for a binary quintic 
is infinite, due to an error relating to the independence of the 
syzygies between the covariants. For a binary form of order <5, 
a finite fundamental system of covariants was constructed in §12. 

19. Piniteness of syzygies. Let Zi, . . . , be a fundamental 
system of invariants of the binary forms /i, . . . , /r. Let 
,Zm) be a polynomial with numerical coefiGicients such 
that S (Ii, Im) is zero identically in the coefiGicients of the fs. 
Then /S(Ji, . . . , Zm) = 0 is a syzygy between the invariants (see 
the examples in §12). 

By means of a new variable Zm+i, we may convert S(zi, . . , 
into a homogeneous form H by multiplying a suitable power of 
Zm+i into aU terms not of the maximum degree. By Hilbert’s 
theorem, the forms H are all expressible linearly in terms of a 
finite number Hi , . . . , ZT* of them. Take Zm+i = 1* Thus 

(15) S^PiSi+ + 

where Pi, , • . , P* are polynomials in zi, . . . , In this identity 
in zi, . . . , Zm, we may take zi == Zi, . . . , Zm = Zm. 

Theoeem 6. There is a finite number of syzygies Si == 0, . . . , 
S& ~ 0, such that every syzygy S = 0 implies a relation (15) in 
which Pi, ... ,Ph ore now invariants. 

In particular, every syzygy is a consequence of Si = 0, ... , 
Sfc = 0. By the Lemma in §18, the proof holds also for syzygies 
between covariants. 
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20. Canonical form of a binary form of odd order. In §7 we 
saw that any binary cubic whose discriminant is not zero is the sum 
of the cubes of two linear forms which are the factors of the Hes- 
sian. We shall prove an analogous theorem for a form/ of any odd 
order 2n - 1. Let / have the notation (1) with p = 2n - 1. 
We seek constants p,, such that 

(16) / ^ Pi (x + ri + ['pnix + Tn 

identically in x, y. The conditions are evidently 

(17) == Pi ri^‘ + • • • + pn rj (i = 0, 1, . . . , 2n — 1). 
We can determine solutions • * * ? of the n linear equations 

( 18 ) Us + U5+1 qn-i + ■ * * + cts+n-i qi + (^s+n = 0 

(s = 0, 1, . . . , - 1) 

if the determinant of their coefficients is not zero. Let ri, , . , , 
be the roots of 

(19) s" + qiZ^-^^+ h qn-^i z + qn = 0. 

If n, . , . j Tn are distinct, conditions (17 ) with i = 0, 1, . . . , n — 1 
uniquely determine pi, . . . , Pn- We shall prove that the remaining 
n conditions are then satisfied- To proceed by induction on s, we 
assume that the conditions (17) hold when i < s + n and prove 
that the condition with f = s + n is then satisfied. Multiply the 
condition with f = s by qny that with z = s 4* 1 by qn-ij • - . , that 
with z = 5 + n — 1 by gi, that with z* = s + n by unity, and add. 
The coefficient of p* is 

r** g« + gn-i + * * • + gi + 

which is the product of r** by the zero value of the polynomial 
(19) for z = ffc. The new left member is evidently (18). 

The equation having the roots n, . . . , is evidently obtained 
by eliminating gi, . . . , g« from (18) and (19). We prefer the 
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homogeneous equation in x and y whose linear factors are a; + 

(i = 1, . . . , n). It is found by taking the determinant of the 
coefficients of • • • , g,u 1 iii 

/-.Q/x 2/” qn^i + 2/«-2 

^ ^ — h (— yqi+ (— xY = 0 

and (18), and hence is 


yn 

— xy^~^ 


... (_ y 

(- »)• 

Oq 

ai 

02 

' * * On—l 

On 

ai 

02 

Oz 

On 


O'n—l 

On 

^n+1 

02n-2 

Cf2n~l 


This determinant is called the canonizant C of /. 


Theorem 7. Every sufficiently general binary form of odd order 
2n — 1 is a sum of (2n — l)th powers of n linear forms. 

Exercises 

1. For n = Zj the coefficient of in the canonizant C of the quintic / is the 
invariant J of the quartic q = gqX^ + 4taix^y , In particular, / is a 
seminvariant of q and hence is annihilated by the operator n in (6) with 
p = 4. Since J lacks it is therefore annihilated by Q with p = 5 and hence 
is a seminvariant of /. 

2. Show that J is a seminvariant of every / with p ^ 4 since it is a rational 
function of the seminvariants ao, &, Si. See (21) of §12, and §10. 

3. There exists a covariant of order 3 of the quintic / with the leader J. It 
coincides with C since the canonizant of the form derived from / by any linear 
transformation T has as linear factors the functions derived from the linear 
factors of C by T. 

4. For n = 3, the canonizant C of / is the minor of in A: 


yS 

0 

0 

0 


1 a; 0 0 

do 

do X + ai 2/ 

dix + 02 y 

chx +azy 

, o = 

0 y X 0 

ai 

dix -jr <hy 

ar 2 X + 03 y 

dsx +aiy 

0 0 y X 

di 

02 X + dsV 

dzx + diy 

diX +05 y 


0 0 0 p 


For, the product of C by D = y% rows by columns, is A. 
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5. If a senunvjiriant S of the quintic f lacks 05, it is a seminvariant of the 
quaitic q. If the wei^t of S is double its degree, /S is an iuTariant of q. 

6. Fiad the determinant whose vanishing is a necessary and sufficient con- 
dition that a binary form / of order 2n be a sum of the 2»th powers of n linear 
forms. 

Hint: Employ (18) also for s = n. 

21. Further topics on covariants. Reference will be made to 
elementary introductions in English to the following subjects: 
Seminvariants and covariants as functions of the roots.^ 
Symbolic notation; geometrical applications.^ 

Canonical form of the binary sextic and octavic.® 

Functionally complete system of covariants of a binary form, 
found from a complete system of three linear partial differential 
equations.* 

Theory of covariants under linear transformations with integral 
coefficients taken modulo p, a prime.® 

The coefficients of a unique canonical form are invariants.® 
Interesting applications of invariants of a quintic form are 
given in the elementary papers cited in §112. 


^Dickson, Algebraic Invariants, Wiley and Sons, 1914, pp. 53-58. 

® IHd.t pp. 63-97, The Algebra of Invariants by Grace and Young is very difficult 
for students, except possibly the chapters on geometry. To readers familiar with 
German are recommended Gordan, Vorlesungen iiber Invariantentheorie, 1887; 
Clebsch-Lindemann, Vorlesungen uber Geometric, 1, 1876; II, 1891; and the up- 
to-date book on Invariantentheorie by Weitzenbdck, Groningen, 1923. The spe- 
cialist will need Meyer's report on invariants in Jahresbericht der Deutschen Math.- 
Vereinigung, 1, 1890-2, 79-292. 

® Elliott, Algebra of Quantics, 1895, 294-9; ed. 2, 1913, 285-90. 

^Dickson, Annals of Math., 25, 1924, 369-76. We may start with (18),'p. 373, 
noting that «/, vf, and wf are Of^Qf, and (Q 0 — 0 Q)f of our text. 

® Dickson, On Invariants and the Theory of Numbers, the Madison Colloquium 
of the Amer. Math. Soc., 1914. Cf. Glenn, Treatise on the Theory of Invariants, 
1915, 175-208. Dickson, History of the Theory of Numbers, III, 1923, 293-301. 

®WilczyTiski, Proc. National Acad. Sciences, 4, 1918, 300-5. 



Chapter III 

, MATRICES, BILINEAR FORMS, LINEAR EQUATIONS 

Chapters III-VI, which are independent of I-II, give a new 
exposition of the subject usually called higher algebra. We first 
develop Cayley’s calculus of matrices^ and the essentially equiva- 
lent subject of bilinear forms. The main theorems on the solu- 
tion of systems of linear equations are not presupposed, but are 
deduced as corollaries. 


22. Linear forms, matrices, linear transformations. Any homo- 
geneous linear fxmction, such as 7x — is called a linear form. 
Consider m linear forms 

r = ail 2/i + 2/2 + axn, 2 /n? 


2/1 *4“ 2/2 “4" * * * “f" yn 

in the n variables yi, , yn. Write the coefficients of these suc- 
cessive linear forms in the successive rows of a table 


( 2 ) 


A = 


Uii ai2 * • • ain 


ami am2 * * * amn 


Such a rectangular table composed of mn numbers arranged in 
m rows with n numbers in each row is called a matrix with m rows 
and n columns, or briefly an m X ^ matrix. The mn numbers a^, 
are called the elements of the matrix. We shall speak of A as the 
matrix of (the coefficients of) the linear forms (1). 

Examples of one-rowed matrices are the notations (x, y) and 
(Xj y,z) for points in a plane and in space, the elements of these 
matrices being the coordinates of the points. 

39 
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Matrix (2) is said to be equal to the m X n matrix 

[dll di2 • • * din ] 


[dml dm2 * * * d mn ] 

if and only if corresponding elements are equal: 

dll ~ dll, ^12 ” di2, • • • 3 0>inn “ dmn* 

We shall also write (1) and (2) in the compact notations 

n 

(1^) 1% = S ^iiVi (“i ~ 1, . # . , 

3=1 

(2') A = {an) (f = 1, . . . , m; y = 1, . , . , n), 

in which we exhibit the element an which lies in the fth row and the 
yth column of the matrix (2). 

Suppose that the n variables are expressible linearly in terms 
of s new variables 2 i, . . . , 

8 

(3) Vi = 2 2* O’ = 1, • • . , n). 

*=1 

Inserting these values into (1')? we get 

n 8 8 

( 4 ) li ~ ^ l^jJc ^ ^ = !,•••, 7Tl)f 

3-1 &-l jfc-1 

where 

n 

(5) jpik ^ ^ ^ ^^3* i^3'fc (“i ~ 1, * • . , 7yi] Jc = 1, 

3-1 

Let B = (&/;{;) and P = (p»*) denote the matrices of the hnear 
forms (3) and (4) respectively: 



§ 22 ] 


LINEAR FORMS, MATRICES 


41 


* * • iu pii * ‘ * Via 

(6) B = , P = 

J^nl * * * ^ns [Pml * * * Pms 

We shall call P the product of A and P, taken in that order, and 
shall write P = AB, In view of (5), we therefore have the 
following 

Rule of multiplication of matrices: The element in the ^'th row 
and A;th column of the product of a matrix A with n columns by a 
matrix B with n rows is the sum of the products of the successive 
elements of the ith row of A by the corresponding elements of the 
fcth column of B, 

For example, 

/a b\ /a _ /aa + 67 a^ + bd\ 

\c d)\y d) \ca + ^7 Cj8 + dd) 

By a linear transformation we shall mean a system of n linear 
homogeneous equations 

n 

(7) (j=l, 

which express the variables yi, linearly in terms of the 

same number of variables zi, ^ Zn- This transformation is said 
to have the matrix (6jfc). Taking s = n in (3)- (6), we shall 
say that the transformation (7) replaces the system of linear forms 
(1) by the system of linear forms (4). This proves 

Theorem 1. A linear transformation with the matrix B replaces 
a system of linear forms with the matrix A by a system of linear forms 
with the matrix AB. 

When m = n, equations (!') define a linear transformation 

n 

li = fflij 2/j ~ 1) • • • j 


( 8 ) 
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which expresses the variables Zi, . . - , linearly in terms of the 
variables yi, . . . , yn^ We saw that the elimination of yi, . . . , 
between equations (!') and (3) gave (4) and (5), which, in our 
special case m ^ n = s, become 


S "" (iyk — 1, , n). 

The resulting linear transformation, which expresses h, ... ,1^ 
linearly in terms of si, . . . , Sn, and has the matrix AB, is called 
the product of the linear transformation (8) with the matrix A by 
the linear transformation (7) with the matrix B. 

The one-to-one correspondence between linear transformations 
and square matrices is therefore preserved under multiplication. 
For this reason a linear transformation is often denoted by its 
matrix when the particular choice of the letters used to denote the 
variables is immaterial. 

The determinant ] j j having the same 'n? elements as the matrix 
of a linear transformation (8) is called the determinant of the trans- 
formation. Since the above rule of multiplication of square ma- 
trices is one of the rules of multiplication of determinants, we con- 
clude that the determinant of the product of two linear transformations 
is equal to the product of their determinants. 

A linear transformation is called singular if its determinant is 
zero, otherwise non-singular. 

23. Inverse, identity, scalar, adjoint. Let a denote the deter- 
minant of transformation (8). If a 5 ^ 0, equations (8) have the 
unique solution 

(9) = 

where A^i denotes the cofactor of Oi,- in a. Equations (9) define 
a linear transformation which expresses the linearly in terms of 
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the Vs; it is called the inverse of transformation (8). Since it is 
immaterial what letters are used to denote variables, let us replace 
each by 21 *. Then the product of (8) by its inverse 

( 9 ') yi='f^ — Zk {j = 

h-l o, 

is evidently the identity transformation I defined by 

(10) li — Zij . . . , Zji = Zn- 

The inverse of (9) is evidently the initial transformation (8). 
Note that I = Si ii St denotes 

( 11 ) Ui = tVij , , . jUn ^ tVn- 

The product of (8) by in either order is 

n 

~ ^ ^ (^* “ Ij * • • J ^)) 

whose matrix will be denoted by either tA or AZ, being obtained 
by multiplying every element of A by t. The matrix of (11) will 
be denoted by St and called a scalar matrix; its diagonal elements 
are t and all remaining elements are zero. In particular, the matrix 
of (10) is denoted by I and is called the identity matrix. Thus 
Si = /. Our results give 

(12) StA = ASt - tA, IA=AI = A. 

Hence in products we may replace a scalar matrix factor St by the 
number t In particular, we may suppress a factor I. 

A matrix all of whose elements are zero is called a zero matrix 
and designated by 0. By (12) with Z = 0, we have OA = AO 
= 0 for every matrix A. 
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The matrix of (9') is designated by 


(13) 


A-i = 


An/ a • • • Anx/a 


Ain/a • • • Ann/a 


and called the inverse of A. By the above results, 


(14) = A-^A^L 


If we suppress the denominators a in (13), we get 


(15) 


Adj. A 


All * 

■■Anl 

Ai,t • 

'' Ann 


vhich is called the adjoint of matrix A. Hence the element A j^m 
jhe zth row and jth column of the adjoint of A is the cofactor of 
the element in the jth row and ith column of A. 

In case the determinant a of A is not zero, we have Adj. A 
= a A~^ by definition; then (14) implies 


(16) A (Adj. A) = aly (Adj, A)A == al. 


This result (16) holds true also when a = 0, as shown by con- 
sideration of continuity or by direct multiplication. 


24, Associative and distributive laws for matrices. Let A 

= (a^j), B = C == (c^,) be any m X n, n X s, s X t 

matrices, respectively. By (5), the element [ik] in the ith row 
and fcth column of the product AB is ^iaijb^k. By the same 
rule, the element [^7] of (AjB)C is 

(17) i2(iaiibjk)cH. 

& j =»1 

The element [jl] of BC is bjh Cki- Hence the element [il] of 
A(BC) is 
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j— 1 ^ k^l 

Since this is equal to the sum (17), we have 

Theorem 2. Multiplication of matrices is associative,^ i.e., 

(18) {AB)C ^ A{BC). 

Either of these products may therefore be denoted by ABC 
without ambiguity. It follows that ABC • D, AB • CD and 
A • BCD are equal and may be denoted by ABCD. Similarly 
for the product of any number of matrices taken in a prescribed 
order. The case of equal factors shows that is uniquely defined 
when the exponent e is any positive integer. 

The sum (or difference) of two m Xn matrices is defined to be 
the m Xn matrix each of whose elements is the sum (or differ- 
ence) of the corresponding elements of the given matrices. For 
example, 

\c d)-\y d) ^ \c± y d± d/ 

It follows at once that 

U+B^B + A,{A + B) + C^A+{B + C), 
d" tB = t{A B)j tA Hh yA = (t -f" u^A. 

The element [ik] of A (B + C) is + c/*), which is 

the sum of the elements [ik] of AB and AC, In this manner, we 
obtain 

Theorem 3. Multiplication of matrices is distributive, i,e., 

(20) AiB±C) -=AB± AC, {B ± C)A = BA ± CA, 

^Another proof follows from §1. 
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In view of the associative law, the inverse of any product of ma- 
trices is the product of their inverses taken in reverse order. For 
example, 


ABC • C-i A-i - A • BB-^ • = AA-^ = 1. 

If A is non-singular, AX = B has the unique solution 
X' = A"^ B, while 7A = B has the solution Y == BA~^[^ Since 
matrix multiplication is not commutative in general, X and Y are 
usually distinct, so that there are two kinds of division of 5 by A. 


Exercises 

1. Scalar matrices are the only ones commutative with every n 'Kn matrix. 

2. Show that A~^ is the only matrix whose products with A in both orders 
gives the identity matrix. 

3. By multiplication of the matrices 


1 



0 \ . _ / 0 i\ 

0 - V-i/’ ^ “ \-i 0/’ 


■=( 


V - 1 0 /' 


verify that ^ = &? = — J, == fc. By the associative law, 

ifc es iij = — j, Joj = ijj = — hi = k{— hj) = j, 
ji ^Ui - h, jk = i(— ji) = i. 


Any linear combination of I, iy j, k with numerical coeihcients is called a 
quaternion. The two quaternions 


q ^ al A hi cj A- dky q' = al — hi — cj — dk 


are caEed conjugates. Verify that 



X — a 
y = c 


+b^, 

+ d V — if 


and that qq' = q^q = nl, where n = A yy is called the norm of q and 
written niq). 
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4. The conjugate of the product of two quaternions is equal to the product 
of their conjugates taken in reverse order. For, 


/ u 

v\ 

/ r 

s\ 

, , (r 

— S\ 

{ - 


ep = 1 - 


V ^ = (: 

1 

\— V 

u/ 


ri 


Tf 


where r - xu -- yv^ s — xv yu. To find the norm of qp, note that 
qp-p'q' -q*n{p>)I • -qq^-nip)! ^ n{q)n(p)L 
Hence the norm of a product is the product of the norms of the factors. 

5. In matrices (2) and (6), let the pij be themselves matrices, such 

that those appearing in any row have the same number of rows, while those 
appearing in any column, have the same number of columns. Prove that (5) 
holds also here. If each b^J — 0 when i ^ j, find the two products of (a»/) 
and 


25. Characteristic equation. Let A be an n-rowed square 
matrix whose elements are independent of the variable X. Let I 
be the ?^-rowed identity matrix. The matrix A ^ }J is called the 
characteristic matrix of A; it may be obtained by subtracting X 
from each diagonal element of A. The characteristic determinant 
of A is the determinant of A — XI and is a polynomial in X of 
degree n: 

(21) ^(X) = Un X” + an-1 X^~^ (i\\ + Uo, an = ( — I)". 


The equation <i>{\) = 0 is called the characteristic equation of A. 
For example, 


( 22 ) 

<j>{\) = 




6 

d - X 


= X^ — {a (i)X Hh ad — he. 


If in any polynomial /(X) = ^ Ci X® we substitute the matrix 
A for X and multiply the constant term co by J, we obtain a matrix 
denoted by /(A). In the special case (22), we get 
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which is readily verified to be equal to matrix 0. This result is 
the case ^ = 2 of the important 

Theoeem 4. Any square matrix satisfies its characteristic 
equation. 

Let (21) be the characteristic determinant of A, We are to 
prove that 4>(A) =0. Since the elements of A — X/ are linear 
functions of X, and the elements of its adjoint C are (n — l)-rowed 
determinants, they are polynomials in X of degree ^ ^ — 1. If 
the element in the ^'th row and jth column of C is Cijh X* then 

C = ^Ck X*, Ck = {Oiik) {i, i = 1, . . . , n). 


Applying (16) with A replaced by A — X/, we have 


(A - \I)C = ^(\)L 

Hence 

AE C, X* - xE C, X* = E a,. X* J. 

A »0 ^;=0 


Equating the terms free of X and the coefficients of X, X^, . . . , 
X”"!, X”, we get 


ACo 


= Uo J, 

ACi 

- Co 

= ail, 

ACz 

- Cl 

= Chl, 

AC-,,! 

- c„_, 

5 == Un-l 


- Cn-l 

L ~ a-n 1, 


Multiply these equations on the left by I, A, A^, . . . , A^-^, A" 
respectively and add; we get 

0 = uo-f + %A + a2A^+ — + ttn-i A”~^ + Cn A” = 0 (A ) . 
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25. Raiik of a matrix. Every matrix M having more than one 
element contains other matrices obtained from M by deleting cer- 
tain rows or columns or both. In particular, it contains certain 
square matrices. The determinants of these square matrices are 
called the determinants of M. 

A matrix is said to be of rank r if it contains at least one r-rowed 
determinant which is not zero, while all its determinants of order 
higher than r are zero. The zero matrix aU of whose elements are 
zero is said to be of rank 0. 

For example, the rank of the matrix 

a h c d e 

p q r s t 

abode 
p q r s t 

is 2 or less since every three-rowed determinant has two rows alike and is zero. 
The rank is 2 if a, 6, c, d, e are not proportional to p, q, r, s, t, so that a two- 
rowed determinant is not zero. The rank is 1 if they are proportional and not 
all zero. 

By the rank of a determinant is meant the rank of its matrix. 

Theoeem 5. If A is any matrix with n columns and B is any 
matrix with n rows, any t-rowed determinant D of matrix AB is equal 
to a sum of terms each a product of a invowed determinant of A by a 
t-rowed determinant of B, 

Let A = iai^), B = (&^,), P = AB = (p^) be given by (2), 
(5), (6). For the case n = 3, i = 2, the theorem is illustrated by 
the formula 


P12 Pl 3 


an bi2 “b <^12 ^22 cti 3 bz2 ctii biz “b a,n b%z bzz 

P22 P23 


Chi bn + O22 &22 + U23 bz 2 Chi biz + Ch 2 bzz “b Chz bzz 


an <^12 
Chi Ch2 


&13 

622 ^^23 


+ 


0^11 0^13 

Chi ChA 


bi2 biz 
bz^ bzz 


+ 


ai2 aiz 
022 Chz 


622 &23 

&32 &33; 
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Any ^-rowed determinant D of P is of the form 
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Vnh 



n 

Jl-l 

n 

• ®ii3j 

P% tH * ' 

Vii^t 


n 

n 


where ii < 12 < • • • <2^, < ^2 < * • • Since each element 

of the first column is a sum of n terms, D is equal to the sum of n 
determinants of which the jith has as elements of the first column 
the jith terms of those sums and has as elements of the remaining 
columns the elements of D. In other words, we may remove the 
summation sign from the first column and place it in front of the 
symbol of the resulting determinant. The common factor h2^k^ of 
the elements of the first column may be taken out as a factor of 
the determinant. Treating the other columns similarly, we get 


J -1 3 =1 


h 




A = 










Unless jij • ^ • ,jt are distinct, A — 0. Select gi, . . • , from 
1, . . . , TO SO that gi <g2 < ■■■ <gt. If ji = gi, . . . ,jt = gt, 
A is a ]^-rowed determinant a oi A. Next, let ji, ,3 1 be an 
arrangement of gi, — , g ^ which is derived from gi, ,gt^j I 
successive interchanges of two terms. Hence A may be derived 
from a by Z successive interchanges of two columns, so that 
A = (— 1)^ a. The sum of the n\ products (— 1)^ 
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corresponding to all such arrangements ii, . . . , j < is, by definition, 
the expansion of the determinant 

^^ 1*1 * * * 

/5= 

^Oth ‘ * ^ot^t 

which is a determinant of B, Hence D = ^ a where the sum- 
mation extends over all the selections gi, ... ,gt from 1, ... ,n 
such that gi < g 2 < • * • <gf 

t. 

CoEOLLARY. The rank of the product of two matrices cannot 
exceed the rank of either factor. 

For, if all i-rowed determinants of A (or of B) are zero, the 
same is true of aU Crowed determinants of AB. 

Theorem 6. If A is any matrix with m rows and n columns and 
B is any non-singular, n-rowed, square matrix, then A and AB have 
the same rank. If C is any non-singular, m-rowed, square matrix, 
then A and CA have the same rank. 

For, if r is the rank of A and p is the rank of P = AB, the 
Corollary gives p ^ r and, when applied to A = PB^^, gives 
T S p, whence r = p. Next, if r' is the rank of CA, the Corollary 
gives r' St and, when applied to A = • CA, gives r S r', 

whence r' = r. 

27. Bilinear forms. A polynomial in the m + n variables 
Xi, ... , Xm, yi, ... ,yn is called a bilinear form if each of its terms 
is of the first degree in the x^s and also of the first degree in the 
y^8. An example with m = 1, n = 2, is 7xi yi — 5xi y%. The 
general bilinear form is 

m n 

i-l 


( 23 ) 
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We may write 

m n 

a = h V} (i = ■ ■ ■ , m). 

The matrix A = (a^J) of these m linear forms U is called the 
matrix of the bihnear form (23). The coefficient a^J of x^ in a 
is therefore the element in the ^th row and ^’th column of the 
matrix A of a. 

By Theorem 1, the linear transformation (7) with matrix B 
replaces Zi by where (p^k) = AB, Hence that trans- 

formation replaces a by 


m n 

zz Ptk 


whose matrix is AB. 

If we do not alter the but apply the transformation 

m 

(24) Xi = X} Cik h (i = 1, . . . , m) 

k^i 

of matrix C to a in (23), we get 

m n m n 

X) X] 7 “ X/ ^kV 7 j 

i,fc— 1 j==« 1 fc==l 

where dfc/ = Hence djb, is the sum of the products of 

the elements of the fcth row of matrix 



by the corresponding elements of the jth column of A == (a^,), so 
that the new bilinear form has the matrix C' A. Matrix C' is 
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called the transpose of matrix C, being derived from C by employ- 
ing the successive rows of C as the successive columns of C\ 

Theorem 7. If in a bilinear form with the matrix A we subject 
the x^s to a linear transformation (24) with the matrix C, and the y’s 
to a linear transformation with the matrix B, we obtain a new bilinear 
form with the matrix C'AB^ where C' is the transpose of C. 

We may write the bihnear form (23) as follows: 

n m 

ct — ^ ^ yjj Xj* == Gif x% (^' “ Ij , . . j ?2-). 

3=1 

The matrix of the linear forms Xi, . . . , Xn is the transpose A' of 
matrix A, Transformation (24) with matrix C replaces Xi, . . . , X„ 
by linear functions of ?i, , . . , f ^ the matrix of whose coeflEicients is 
A^C (Theorem 1). Hence this transformation replaces a by a 
bilinear form with the matrix (A 'C ) By the result preceding the 
theorem, this matrix must be equal to C'A. Since the transpose of 
A' is A, this process proves 

Theorem 8. The transpose of a product of matrices is equal to the 
product of their transposes taken in reverse order. 

Exercises 

1. Prove Theorem 8 by direct multiplication. 

2. Prove that by means of the transpose of = J. 

3. If AA' = /, then A'A = J. 

4. An mXn matrix (aij) is of rank 0 or 1 if and only if there exist m -hn 
numbers ci, . . . , c^n) di, — , such that an = dj- (t == 1, . . . , m; j « 1, 

7 ^). 

28. Fields. Two forms (each linear, bilinear, or quadratic) 
may be equivalent under a transformation with complex coeffi- 
cients, but not equivalent under one with real coefficients. Again, 
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two iorms may be equivalent under a transformation with real 
coefiSicients, but not under one with rational coefficients. Here, 
as elsewhere in modern algebra, it is necessary to specify carefully 
the nature of the constants employed. To do this briefly and 
clearly, as well as to give full generahty to our theorems, we make 
use of the concept ^‘field” (or domain of rationality). 

A set of complex numbers a + is called a field if the sum, 
difference, product, and quotient (the divisor not being zero) of 
any two equal or distinct numbers of the set are themselves num- 
bers belonging to the set.^ 

The following are examples of sets of numbers each forming a 
field: all complex numbers; all real numbers; all rational num- 
bers; aU numbers of the form a + br with a and b rational num- 
bers and r = i or r = 

But the set of all integers is not a field; nor do all positive real 
(or rational) numbers form a field. 

29, Linear independence.^ The quantities (numbers or func- 
tions) hj , . . ,ln are called linearly dependent with respect to a 
field F if there exist numbers Ci , not all zero, of F such that 

(25) Cl + • • • + Cn Zw = 0. 

If no such numbers Ci exist, h, ... ,ln are called linearly inde-- 
pendent with respect to F. 

It is convenient in proofs by induction on n to employ these 
terms also in case n = 1 . Thus h is hnearly dependent or inde- 
pendent according as = 0 or k 9 ^ 0, respectively. 

^ Further fields satisfy the definition by postulates in the author’s Algebras and 
Their Arithmetics, University of Chicago Press, Ch. XI. An example is the field of 
the residues of integers modulo a prime. For it, our theorems become important 
properties of congruences. 

^We shall not presuppose the theory of systems of linear equations with coeffi- 
cients in any field F, but deduce that theory in §§31, 32 from the present discus- 
sion. For this reason we avoid the shorter proof of Theorem 9 by means of that 
theory. 
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Theorem 9. If Vn ore linearly independent with respect 

to afield Fj the n linear functions 

n 

(26) l^ ^ ai 2 y j = 

j=i 

with coefficients in F, are linearly independent or dependent with 
respect to F according as the determinant A = | a^J | is not zero or is 
zero. 

First let A 0 and assume, contrary to the theorem, that a 
relation (25) holds. Then 

n n n 

EciZi = 0, = 0 (j=l,...,n). 

i, 2—1 ^=l 

Hence A = 0, and every c^ = 0, contrary to hypothesis. 

Conversely, if Zi, , . . , are linearly independent, then A 0. 
We shall give a proof by induction on n, noting that it is evidently 
true when n == 1. If = 0, 1 * == 0 would be a relation (25). 
Hence after permuting the we may assume that ann 9 ^ 0. 
Since our converse will follow if proved for h, , In-i, anrT^ L, we 
may assume that ann = 1. Then 


71-1 

Zi ~ 1% airfin ~ j {an a^n ^nj^y f (i ~ 1, 1) 

3=1 

are linearly independent linear functions of 2 / 1 , since 

n— 1 71—1 

^ ^ Ci Z I — ^ ^ C £ (Z I am In) ~ 

implies that each = 0 in view of the linear independence of 
li, ... j In- Hence by the hypothesis for the induction, the deter- 
minant 


D — ai2* ain an 2 (fjj Ij ... j n 1) 
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is not zero. If in A we add to the first, second, . . , , (n ~ l)th 
columns the products of the elements of the last column by 
— Uni, an2, . . . , — ann-i, respectively, we get a determinant 
whose last row is 0, 0, . . . , 0, 1 and having D as the cofactor of 1. 
Hence A == D 5*^ 0, 

Theobem 10. If yij y yn are linearly independent with respect 
to Fy and hj . • , ylm CLre any linear forms in yi, • • • ,yn with coeffir 
dents in F, we can select certain of the VSy say 

— Ik^ (i = 1, . • . , r), 

which are linearly independent with respect to F and are such that the 
m — r remaining Vs are expressible linearly in terms of ^7i, . . . , rjr 
with coefficients in F, 

As rji we may take any which is not zero identically in the 
^^s. If every is a product of rji by a number of F, the theorem 
holds with r = 1. In the contrary case, take as any which is 
not such a product, whence rji and 7]2 are linearly independent. If 
every Zy is a linear combination of m and with coefl&cients in F, 
the theorem holds with r = 2. In the contrary case, take as 173 
any which is not such a combination, etc. 

The value of r is given by Theorem 13. 

Theorem 11. If r <n in Theorem 10, there exist linear forms 

rjr+iy * • • , rjninyi, . . . j yn with coefficients in F such that rji, , rjn 

are linearly independent with respect to F, 

First, 2/1, • . . , 2/n are not all linear functions of 971, , 7}^ with 
coefficients in F. For, if so, we have in particular 

r 

(27) yi = tik rjk (f — 1, , , . , 7*). 

Hence by Theorem 9, ] Zi;bl ^ 0. We may therefore solve (27) 
and obtain 97 a; as a linear function of 2/1, , Vr with coefficients 
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in F. By hypothesis, yiii > r) is a linear function of the and 
hence of 2 / 1 , . . . . , yr with coefficients in F. This contradicts the 
linear independence of yi, . . . , yn with respect to F. 

Hence not all linear functions of yi, . . . , yn with coefficients in 
F are linear functions of . . . , with coefficients in F. Thus 
there exists a linear function rjr+i of the y’s such that 771 , • . . , yr? 
yr+i are linearly independent with respect to F. If r +- 1 = n, the 
theorem is proved. If r + 1 < we repeat the argument in the 
first part with r replaced by r + 1 and conclude that yi, . . . , y^ 
are not all linear functions of yi, . . . , yr+i with coefficients in F. 
Thus there exists a linear function Vr +2 of the y^s such that yi, . . . , 
7]r+2 are linearly independent with respect to F. This proves the 
theorem if r + 2 = n. If r + 2 < n, we repeat the argument. 

30. Equivalence of two matrices or two bilinear forms. Two 
bilinear forms a and ai in , Xm, yi, . . . , yn, with coefficients 

in F, are called equivalent in F if and only if a becomes ai when we 
subject the r’s in a to a linear transformation with non-singular 
matrix C and the y’s to a linear transformation with non-singular 
matrix B, the elements of C and B being in F. Then also the ma- 
trices A and Ai of a and ai are called equivalent in F. By Theorem 
7, Ai = C'AB. In other words, two m X n matrices A and Ai 
with elements in F are called equivalent in F ii and only if there 
exist two non-singular square matrices^ D and B with elements 
in F and having m and n rows, respectively, such that DAB = At. 

Theorem 12. Two matrices {or bilinear forms) in F are equivor 
lent in F if and only if they have the same rank. 

If the matrices (or bilinear forms) are equivalent, they have the 
same rank by Theorem 6. 

Conversely, if a and ai are any bilmear forms in F with the same 
rank, they are equivalent in F. This is evident if the rank is zero, 
whence a = 0, ai = 0. Consider therefore a bilinear form a, 

^ We Iiave written D for the earlier C'. Conversely, given i>, we get C == Z)'. 
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m w 

^ ^ ~ ^ ^ij Vi 

^ =1 2 =1 


(i = 1, . . . , m), 


in which the a^^ are numbers of F not all zero. By Theorem 10, we 
may select certain of the Z's, say rji — hi (i = 1, . . . , r), which 
are linearly independent with respect to F and such that the m -- r 
remaining Vs are linear functions of 971 , . . . , 77 ^ with coefficients in 
F. Since ki, . , , ykr are distinct, we may select m -- r further 
Fs so that ki, . , . ,km form a permutation of 1, . . . , m. Also for 
i > r, denote Ik^ by 77^. Introduce the new variables 

Xi = Xki (t = 1, . . . , m). 

Then 

m m r 

OL ~ Xk^h^ “ X % ^ ^ V 3 ^ ^ j • • • > ^ ) > 

is*l ^=1 j*»l 

where the d ^3 belong to F. Hence 

r m r r 

oi X Q 3 "4“ ^ Xl. % ^ d-i j 97 2 ■ ^ 3 * V 33 

3-=l 3=1 3=1 

if we write 


= Zj cZi, Z» (j = 1, . . . , r). 

i—r+l 


The latter and == Zi(i = r + 1, . . . , m) give m linearly inde- 
pendent linear functions ^ 1 , . . . , of Zi, . , . , Xm with coeffi- 
cients in F, Hence the ^’s are linearly independent linear func- 
tions of oji, . . . , Xm with coefficients in F. The determinant of the 
matrix M of the coefficients of rci, . . , , in the ^'s is not zero by 
Theorem 9. If r < n, there exist, by Theorem 11, linear functions 
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?r+i, . . . , tn of ?/i, * . . , with coefficients in F such that ) 7 i; . . , , 
Vn Tr+i? - . . , f « are linearly independent with respect to F. If 
r = n, we suppress the f ^ and employ , 97 ^. In either case, 

the determinant of the matrix N of the coefficients of ^ 1 , , yn 
in these n linear functions is not zero by Theorem 9. 

We have now proved that a can be reduced to the canonical 
form 

(28) 


by a linear transformation expressing the x^s as linear functions 
of the f^s with the matrix M~^ and a linear transformation express- 
ing the y’s as linear functions of the and f ’s with the matrix 
N'^K The matrix of (28) is evidently of rank r. By the first part 
of our theorem, also a is of rank r. 

Since a and ai are both equivalent to (28), they are equivalent 
to each other. 

We have also proved the first part of 

Theorem 13. The r of Theorem 10 is the rank of the matrix of the 
coefficients of h, , Im- Moreover, the rank of the matrix T of the 
coeffikients of tju • • * j Vr is r. 

Let p be the rank of T. Apply the first part of the theorem to 
rjit • • • fVr in place of Zi, . . . , Im- Hence if r > p, r — p of the 
are expressible linearly in terms of p of them with coefficients in 
F, contrary to the linear independence of 771 , . . . , Tjr with respect 
to F. 

Corollary 1. If m > ?^, any m linear forms in n independent 
variables with coefficients in F are linearly dependent with respect 
to F. 

For, the rank r is evidently g n and hence < m. Then by 
Theorem 10, m — r of the forms are expressible in terms of r of 
them, so that the m forms are linearly dependent. 
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Note that, conversely, if the m forms are linearly dependent, 
then the rank r is less than m. Hence we have 

CoEOLLARY 2. Any m linear forms in n independent variables 
with coefficients in F are linearly dependent with respect to F if 
and only if every ?? 2 ^rowed determinant of the matrix of the 
coefficients is zero. 


Exercises 

1. Find linear transformations which reduce 

2x1 2/2 - 3 xi2/3 + 4x 2 2/1 - 2x2 2/3 - 8x3 2/i + Xzy% to ?i m + I2 m- 

2. 6x + 82/ — 6 , 8 x + 152 , 6 x 4 - 22 / + 5 s — 4 are linearly dependent. 

3 . When F is the field of all real numbers, restate Theorem 12 without 
using explicitly the notion of a field. 

4 . Any matrix of rank r is equivalent to one whose element in the ith row 
and ith column is 1 when i ^ r, while all further elements are zero. 

5 . Any matrix of rank r can be expressed as the sum of r matrices of rank 
unity. Hence any bilinear form of rank r can be written as the sum of r 
products each of a linear form in Xi, . . . , x„i by a linear form in 2/1, , 2/n. 

6. When a bilinear form is the sum of r products each of a linear form in 
xi, . . . , Xm by a linear form in 2 / 1 , ... , 2/n, the matrix of the form is of rank r 
if and only if the r linear forms in the x’s are linearly independent and the r 
linear forms in the y^s are linearly independent. 

7 . If r is the rank of an n-rowed square matrix and if is the rank of the 
matrix composed of the first t rows of the former, then R ^ r + t ^ n. 

8. If n and r2 are the ranks of two n-rowed square matrices, and if R is the 
rank of their product, show that E ^ n 4- r2 — n. Hint: Ilrst prove this 
when the left factor is of the form in Ex. 4 . 

31. Homogeneous linear equations. Consider a system of 
equations = 0, . . . , Zm == 0 in the unknowns 2/ij • • • , 2/n with 
coefficients in a field F and of rank r. After rearranging the equa- 
tions, we may assume by Theorem 13 that h, . , . ,lr are linearly 
independent and have a matrix M of rank r, while . . . , 
are linear combinations of Zi, . , . , Ir with coefficients in F. Some 
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r-rowed determinant of M is not zero. After relabeling tbe we 
may therefore assume that the determinant D of the coefficients of 
2 / 1 , . . . , 2 /r in Zi = 0, . . . , Zr = 0 is not zero. Transpose to the 
second members the terms involving y^ij > r) and apply the 
ordinary method of solution by determinants. We obtain 

(29) yi = Cij y j (i = 1, . . . , r), 

j — r+l 

where the Cij are in F, When these values of yi, , yr are 
inserted in Zi, . . . , we therefore obtain expressions in yr+i , . . . , 
2/n which are zero for aU values of the latter. The same follows for 
Ir+u * • * , Im since they are linear combinations of Zi, . . . , Ir. 

Theorem 14, Given m homogeneous linear equations in n un^ 
knowns whose coefficients belong to any field F and have a matrix of 
rank r, we may select r of the equations so that their matrix has a 
non-vanishing r-rowed determinant. These r equations determine 
uniquely r of the unknowns as homogeneous linear functions^ with 
coefficients in F, of the remaining n — r unknowns. For all values of 
the latter^ the expressions for the r unknowns satisfy the given m 
equations. 

Corollary. There exist solutions not all zero if and only if 
T <n and certainly if ??^ < n. In particular, n homogeneous linear 
equations in n unknowns have solutions not all zero if and only if 
the determinant of the coefficients is zero. 

If the homogeneous linear equations Zi = 0, . . . , Z^ = 0 have 
the two solutions (^^i, . . . , Un) and (vi, . . . , Vn)y they evidently 
have the solution {gu^ + hvi, — , gUn + hvr,), in which g and 
h are any numbers. We call the two solutions linearly dependent 
if there exist constants g and A, not both zero, such that gui + hvi 
= 0 (i = If ... yU); but linearly independent if no such constants 
g and h exist. Analogous definitions apply to more than two 
solutions. 
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For example, the single equation 2yi + 3^/2 5^3 = 0 has the 

linearly independent solutions 

(wi, uz) = (- 3/2, 1, 0), {vu V2, vz) = (5/2, 0, 1), 
while every solution (t/i, y 2 , Vs) is linearly dependent on them: 

Vi = y2Ui + yz vi, 2/2 = y2U2 + yz V2, yz == y2Uz + yz vz. 

This example illustrates the following 

Theorem 15. Any system of m homogeneous linear equations in 
2/i? • • • 7 'f'ojnk r < n has n — r linearly independent solutions^ 
while every solution is linearly dependent on them. 

With the notations (29), we may take as the n -- r solutions 

(30) (ci,*, . . . , Crj, 0, . . . , 0, 1, 0, . . . , 0) (y = r+1, . .. ,n), 

where 1 is in the yth place. Multiply the numbers in (30) by y^j 
sum as to j, and apply (29) ; we get the solution 

y 7 ^1? ~ Vh • • • ? 7 ^rj — yrt Vr+lj • • * ; Vn)} 

in which the summations extend from j = r+ l to j — n. Since 
these numbers are all zero if and only if 2/r+i, • • • yVn are all zero, 
the solutions (30) are linearly independent. 

32. Non-homogeneous linear equations. Consider a system of 
m equations = Ai, . . . , = /bm, where the are constants and 

h, ... jlm are linear forms in yi, . . . , t/n whose matrix A is of rank 
r. After rearranging the equations, we may assume that h, . . , ,1^ 
are linearly independent, while 

r 

li = Ci] li {i = r + 1, . . . , m). 


(31) 
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Hence our m equations U = ki are inconsistent unless 

r 

(^2) ki — ^ j Cijkj “ 7* Ij • . . j 

i=i 

Write Xt = U — kiy^+i, where yn+i = 1. Since h, . .. ,lr are 
linearly independent, the same is true of Xi, . . . , K- Next, 

r 

^3 (i = r + 1, . , . , m) 

if and only if conditions (32) hold, and then the matrix of Xi, . . . , 
\m is also of rank r by Theorem 13. 

Theorem 16. Let r he the rank of the matrix A of the homogeneous 
linear functions hy * ,lm of yi, , yn» Let B be the augmented 
matrix derived from A by annexing a column having the elements 
kiy , , , , kmy so that the rank p of B is ^ r. T,he equations li = ki, 

. . . , Ifn — km are inconsistent if p > r. If p = r, certain r of the 
equations determine uniquely r of the unknowns as linear functions 
of the remaining n — r unknowns; for all values of the latter ^ the 
expressions for these r unknowns satisfy also the remaining m -- r 
equations. 



Chapter IV 


QUADRATIC AND HERMITIAN FORMS, SYMMETRIC 
AND HERMITIAN BILINEAR FORMS 

Oiir discussion of these four types of forms requires but little 
more space and effort than are necessary for a treatment of 
quadratic forms alone. There has been a steady increase in the 
importance of the forms named after Hermite. Our results on 
quadratic forms suffice both for their metric and their projective 
classification. 

33. Symmetric bilinear forms and quadratic forms. The form 

n 

( 1 ) 

is called a symmetric bilinear form if it remains unaltered by the 
interchange of Xi and yi for ^ = 1, , . . , and hence if 

(2) CLji (l{j‘ (ij j = , J 7l). 

The matrix A = (an) of a then remains unaltered by the inter- 
change of rows and corresponding columns, and is called a sym- 
metric matrix. In other words, A is symmetric if and only if it is 
equal to its transpose A\ 

By Theorem 7 of Ch. Ill, the transformation 

(3) Xi “ t)%3 ^ j (i 1, , . . , 

with the matrix B == (bn), and the cogredient transformation 

(4) yi=^bijYi (z = l, 

y-l 
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with the same matrix Bj replace (1 ) by a bilinear form ai, whose 
matrix is Ai = B'AB, If A' = A^ then A\ = B'A^B = Ai by 
Theorem 8 of Ch. Ill, whence also matrix Ai is symmetric. 

Any two n-rowed square matrices A and Ai with elements in a 
field F are called congruent in E if there exists a non-singular n- 
rowed square matrix B with elements in F such that Ai = B'AB, 
By Theorem 6 of Ch. Ill, congruent matrices have the same rank. 
Recalling the definition of equivalence in §30, we have 

Theorem 1. Two symmetric bilinear forms with coefficients in a 
field F are equivalent under non-singular cogredient transformations 
(3) and (4) with coefficients in F if and only if their matrices are 
congruent in F, 

If we identify yi with for i = 1, . . . , n., a S 5 unmetric bilinear 
form (1) becomes a quadratic form 

n 

(5) 2 = X ^3> O'ii = O'ih 

i, j«l 

having the same symmetric matrix A = A\ For n = 2, 
q = an + 2 ai 2 iCi + U 22 x^. 

The rank and determinant of A are called the rank and determi- 
nant of q. 

If also in (4) we identify yi with Xi^ and F, and Z,, so that (4) 
becomes (3), we conclude that transformation (3) with the matrix 
B replaces a quadratic form (5) with the symmetric matrix A by a 
quadratic form with the symmetric matrix B'AB, 

Hence we have 

Theorem 2. Two quadratic forms with coefficients in a field F 
are equivalent under non-singular transformation with coefficients in 
F if and only if their matrices are congruent in F. 

A comparison of Theorems 1 and 2 yields 
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Corollary 1. Two symmetric bilinear forms with matrices A 
and A I are equivalent under cogredient transformations if and 
only if the quadratic forms with matrices A and Ai are equivalent, 
where the forms and equivalence relate to the same field F. 

Hence these two problems of equivalence are not essentially 
different and each may be treated by a study of the congruence of 
symmetric matrices. 

34. Hennitian forms and Hermitian bilinear forms. Let S be 
any field containing a number v which is not the square of any 
number of S. Then all the numbers a; == a + in which a 
and b belong to S, form a larger field F. Call x = a — bv^ the 
conjugate of x. 

An important example is that in which S is the field of all real 
numbers, and v = — 1. Then F is the field of all complex numbers, 
and X and x are conjugate imaginaries. 

The bihnear form, with coefficients in F, 

n 

( 6 ) ^a^j-Xzy,, a,i = aii, 
i, ]— I 

is called an Hermitian bilinear form in F. 

If we replace each element of a matrix A by its conjugate, we 
obtain a matrix denoted by A and called the conjugate of A, Hence 
the matrix A of (6) has the property 

(7) A' = A, 

and is called an Hermitian matrix. It is seen at once that 

(8) M = M, MN = MN, = 

By Theorem 7 of Ch. Ill, transformation (3) with the matrix 
B and the conjugate transformation 
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( 9 ) (* = 1 , • • • , 

1==1 

with the conjugate matrix S, replace (6) by a form whose matrix 
is iii = B'AB, Then (7) implies 

Ai' = B'A'B = Ji. 

Theorem 3. A linear transformation with matrix B and the con-- 
jugate transformation replace an Hermitian bilinear form with the 
Hermitian matrix A by an Hermitian bilinear form with the 
Hermitian matrix B'AB, 

Any two n-rowed square matrices A and Ai with elements in a 
field F shall be called conjunctive in F if there exists a non-singular 
n^rowed square matrix B with elements in F such that 
Ai = B'AB, By Theorem 6 of Ch. Ill, conjunctive matrices 
have the same rank. Theorem 3 implies 

Theorem 4. Two Hermitian bilinear forms with coefficients in 
the field F are equivalent under a non-singular transformahon and its 
conjugate with coefficients in F if and only if their matrices are con- 
junctive in F, 

If in (6) we identify yj with 5/, we get the Hermitian form 

n 

( 10 ) 'Z.aijXiXj, a}, = aij. 

if J—1 

This form is equal to its conjugate. If we also identify Y j with 
Xj, we see that (9) become equations which foUow from (3) by 
taking conjugates. Hence Theorem 3 implies 

Theorem 5. A linear transformation with the matrix B replaces 
an Hermitian form with the Hermitian matrix A by an Hermitian 
form with the Hermitian matrix B'AB, 

This implies the following analog of Theorem 4: 
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Theokem 6. Two Hermitian forms with coefficients in the field F 
are equivalent under a nonsingular transformation with coefficients 
in F if and only if their matrices are conjunctive in F. 

A comparison of Theorems 4 and 6 yields 

CoBOLLARY 2. Two Hermitian bilinear forms with matrices A 
and Ai are equivalent under a non-singular transformation and its 
conjugate if and only if the Hermitian forms with matrices A and 
Ai are equivalent, where the forms and equivalence relate to the 
same field F. 

Hence these two problems of equivalence are not essentially 
different. From this and the corresponding result in Corollary 1, 
we see that the conclusions in our further study of two quadratic 
or two Hermitian forms imply corresponding conclusions concern- 
ing two symmetric or two Hermitian bilinear forms. 

Not only have the latter two problems been reduced to the 
former two, but the former two may be treated together by the 
simple device next explained. 

35. Rational reduction of quadratic and Hermitian forms. To 
each element a of the field F we make correspond an element a of 
F such that 

(11) a = a, ag^^, a±g-=^a±g. 

Hence to a matrix A with elements Ui, in F corresponds a matrix 
A whose element in the ^th row and jth column is aij. Hence, as in 
(8), we have 

(12) 1 - A, AQ^AQ, (I)' = A'. 

If F is obtained as in §34 from a subfield S by adjoining and 
if we take a to be 5 for every a in F, we see that relations' (11) 
hold. They evidently hold also if we take a to be a itself. These 
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two cases are the ones needed in Hermitian and quadratic forms 
respectively, as we shall next explain in detail. • 

Consider the form, with coefficients in F, 

n 

^-'2 Cltj X jj (^2% ~ 

i, 

whose matrix A therefore has the property 

(14) - 1. 

If a = a for every a in F, and if x,- = (13) becomes the Hermi- 

tian form (10) whose matrix A is Hermitian and hence has the 
property A^ = A. But if a = a for every a in F, and if = Xj, 
(13) becomes the quadratic form (5) whose matrix A is symmetric 
and hence has the property A' - A. Hence the form (13) fur- 
nishes a generalization of Hermitian and quadratic forms, which 
will be seen to effect a unification of the two theories of those forms. 

Theorem 7. By a non-singular transformation^ with coefficients 
in afield F, any form (13) of rank r with coejfficients in F can be re- 
duced to one of type 

(15) giXiXi + ' ' * + grXr Xr {eoch gi 9^ 0, gi = gi). 

We shall first transform (13) into a like form lXiXi-\- • • • 
having Z 0. If an = 0 and auk 9 ^ 0, the transformation 

Xx — Xky Xk Xi, Xi = Xi (i 7^ 1, k) 

replaces (13) by akkXiXi + If every a« = 0 and 9 ^ 0, 

the transformation 

Xx = Xi, X 2 = gXx + X^i Xi == Xi (i > 2) 

^ ^ Transforaisition, (3), of matrix J?, together with th^ induced transformation 
Xi = X hi Xj. The matrix of the resulting form is B'AB and has the same rank 
as by Theorem 6 of Ch. HI. 
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replaces (13) by IXx + . . • , where I — an g + Taking 

g — 1 / 021 , we have 1 — 2. 

It remains to consider a form a of type (13) having On 7 ^ 0. 
Then 

1 n n 

Cl — — (on Xi + ^ 0,1 a;,) (oii xi+'^aij Xj) 

<^11 i ^2 j-=2 

+ , iCn, ^2, . . . , ^n). 

Denoting the quantity in the first parenthesis by an Xi, we see 
that that in the secon(i parenthesis is equal to an Xi, and hence 
that the transformation 

1 ^ 

a;i = Xi X) Xk = Xk (k > 1) 

replaces a by 

+ <f»(Z2, . . . , Zn, Z 2 , . . . , Zn). 

Since the matrices of a and an Xi Jti have the property (14), the 
matrix of <j!> has that property. We may therefore proceed with <j> 
as we did with (13). Ultimately we obtain (15). The number of 
its terms is evidently its rank and hence is the rank of (13) by the 
preceding footnote. 

CoEOiiLARY 3. Within any field any quadratic form of rank r 
is equivalent to 

(16) gixx^ + b Qr x? (each g, 0). 

56. Canonical forms. When ¥ is the field of all complex num- 
bers, (16) is reduced to + • • • + by the transformation 

Xi = U (i = 1, . . . , r). 

Theorem 8. Within the field of all complex numbers, every quad-- 
ratio form of rank r can be reduced + * • * + f 62/ ^ 
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singular linear transformation. Hence two quadratic forms are 
equivalent if and only if they are of the same rank. 

From this theorem and Corollary 1 we have 

CoROLLABT 4. Two Symmetric bilinear forms are equivalent 
under non-singular cogredient transformations with complex 
coefficients if and only if they have the same rank r. Each can be 
reduced to + • • • + 

Theokem 9. Let F he the field of all real numbers or the field of all 
complex numbers. In the respective cases j take d = a or d for every 
a in F. By a non-singular transformation with coefficients in P, any 
form (13) of rank r with coefficients in F can be reduced to 

(17) yiyi+ Vypyv- Vp+i - - ' - Vvyr, 

where the index p is uniquely determined. Hence two forms (13) are 
equivalent in F if and only if they have the same rank r and the same 
index p. 

Since gi = gi in (15), each g^ is real. After applying to (15) the 
transformation which permutes certain of the x’Sy we may assume 
that ^ 1 , . . . , Pi. are positive and gjij > p) is negative. Write 
— djij > p). Then the transformation 

Xi=-g,-^yi {i = l,...,v), 

Xj = dj-iy,. (j = P + 1,... , r), 

with real coefficients, replaces (15) by (17). Hence any form 
(13) of rank r with coefficients in F can be reduced by a non- 
singular transformation with coefficients in F to one of type (17). 
Suppose that it can be reduced also to 

(18) il “f" • • • “h i g — 2g+l Sg+I — ’ * * — Sr 

We shall prove that p = q- Assume that p > q- By hypothesis, 
(17) and (18) are each equal to the given form (13) when the y’s 
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and s’s are certain linear functions of Xi, j with coefficients 
in F. Hence (17) and (18) are equal for all values of the 
The system of q + n- 'p<n homogeneous linear equations 

(19) zi = 0, . . . , Zg = 0, Vp+i = 0, . . . , 2/n == 0 

in the n unknowns Xi, . , , ,Xn has solutions Xi = Xi, , , , , Xn = Xn 
in jPj not all zero, by the Corollary to Theorem 14 of Ch. III. Let 
Y^ and be the corresponding values of y^ and Zi respectively. 
Equating the resulting values of (17) and (18), we get 

Tl Fl + • * • + Yp Yp = Z — Zr Z r» 

The terms on the left are zero or positive, while those on the right 
are zero or negative. Hence every term is zero. In particular, 
7i, . . . , are zero. In view also of (19), we see that the system 
of equations yi = 0, . . . , t/n = 0 has the solutions Zi, . . . , 
not all zero. Hence the determinant of the coefficient of the x's in 
Vh • ^ * iVn is zero by the Corollary cited. In other words, the 
transformation which reduced (13) to (17) is singular, contrary 
to hypothesis. 

Separating the two cases of Theorem 9, we have 

Corollary 5. Any real quadratic form of rank r can be re- 
duced by a non-singular real transformation to 

(20) yi^+ b 2// - yv\i 2/r^ 

where the index p is uniquely^ determined. Hence two real quad- 
ratic forms are equivalent under non-singular real transformation 
if and only if they have the same rank and the same index. 

For r = 2, the types are yi^ + ~ 2 / 2 ^ - ”■ 2 / 2 ^* 

^Discovered independently by Jacobi, Jour, fiir Math., 53, 1857, 275, 281; 
Werke, 4, 1884, 593, and Sylvester (who called the uniqueness of p the law of inertia 
of quadratic forms), Phil. Mag., (4), 4, 1852, 140; Phil. Trans., London, 143, 1853, 
481; Jour, fiir Math., 100, 1887, 465; CoU. Math. Papers, I. 1904, 376-7, 511; IV, 
523. 
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Corollary 6. Within the field of complex numbers, any 
Hermitian form of rank r can be reduced to 

( 21 ) yiyi yp Vp 2 / p+i yp+i * * * Vryr} 

where the index p is uniquely determined. Hence two Hermitian 
forms are equivalent if and only if they have the same rank r and 
the same index p. 

The rank r of a quadratic or Hermitian form / is the maximum 
order of a non-vanishing determinant of the matrix of / and hence 
may be found from / without resort to the canonical form. A 
corresponding direct method of finding the index p is given in 
Theorem 22. 

Corollaries 1, 2, 5, 6 imply 

Corollary 7. There is a unique determination of p in the 
canonical form 

Xiyi+ + Xpyp- Xp^i 2/^+1 - ' - — XrVr 

to which a real symmetric biUnear form can be reduced by non- 
singular real cogredient transformations, or to which a Hermitian 
bilinear form with complex coefficients can be reduced by a non- 
singular transformation on the x’b and the conjugate transforma- 
tion on the y’s, 

A real quadratic form is called positive if its index and rank are 
equal, and negative if its index is zero. Both are called definite 
forms. A positive form of rank r is equivalent under real trans- 
formation to yi^ + • • • + yj^y while a negative one is equivalent 
to — yi^ - ... - 

Positive and negative Hermitian forms are defined similarly. 
They are equivalent to (21) with p = r and p = 0, respec- 
tively. 
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37. Reduction of a real quadratic form by real orthogonal 
transformation. We shall prove simultaneously the two compan- 
ion theorems: 

Theoeem 10. If f and h are real n-ary quadratic forms and if h 
is positive and of rank n, the pair /, h can be reduced by a real non- 
singular linear transformation to 

/ = Xi + . . • + X. /, = 4. . . . + ^ 2 ^ 

where Xi, . . . , Xn are the roots of \ f — \h \ — 0 and are all real, 

Theoeem 11. If f and h are n-ary Hermitian forms and if h is 
positive and of rank Uy the pair fj h can be reduced by a non^singular 
linear transformation to 

/ = Xi 11 + * ' * + x„ In In, A = |i |i + * ‘ • + In In, 

where Xi, . . . , Xn are the roots 0 / | / — X/i | = 0 and are all real. 

Let A = (a^) and B — (6i,) be the matrices of / and h. Let 
Xi be any root of j A — X5 | = 0. Then the system of n linear 
equations 


n n 

aij 2 / j “ Xi ^ ^ bij Xj ~ 1, • . • j n') 

become after transposition of terms a system of n homogeneous 
linear equations in the Xf whose determinant is | A — Xi jB | = 0, 
and hence have solutions Xi, , , , yXn not all zero. Then 

f a^ij x% X j ^ i Xi b %j X 2 Xj. hj h b i] x ^ x 

if i i i i, 7 

First, let / and h be Hermitian. Then h 9 ^ Qhj the assump- 
tion that h is positive and of rank n. Since / and h are real, 
also Xi is real. 
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Next, let be a real positive quadratic form. Consider the 
Hermitian form H = x^xt having the same coefficients as 

A. Write Xg ^ + ivs, where ? and tj are real. Then H is the 

sum of which by hypothesis is positive unless the ^^s 

are all zero, and Va Vtj which is positive unless the rj’s are 

all zero. Hence H is positive unless the x^s are all zero. Thus 
Xi is real by the JBrst case. 

Thus / — Xi A is a real quadratic form or a Hermitian form in 
the respective cases. Since its determinant is zero, it can be 
reduced by Theorem 7 to a real quadratic or Hermitian form 
, 2/n) lacking one variable yi by a non-singular linear 
transformation which is real in the quadratic case. Let the same 
transformation replace A by H, Hence it replaces 

/- XA XiA+ (Xi X)A 

by 

(22) f - \h m F{y2,. . . ,yn) + (Xi - \)H (yi, y^). 

In the positive form H of rank n, the coefficient of yi yi is not 
zero, since otherwise H would vanish when yi = IjVk — 0{k > 1). 

Hence by the last part of the proof of Theorem 7, there exists a 
non-singular linear transformation which leaves each yk(k > 1) 
unaltered and replaces IT by a form Cigigi + ^( 2 / 2 , . . . , 2/»)? 
where Ci is positive. This transformation, which is real in the 
quadratic case, replaces the second member of (22) by 


P(y2) • • • j Vn) + (Xi ““ X)ci gi gi + (Xi — X)^(?/2, . . . , 2 /n)« 
Write <;» for F + Xi Hence 
/ — XA = ^( 2 / 2 ; . . . , + (Xi - X)ci gi gi - Xi^(y2, . - . , 

Equating the terms free of X and those containing X, we have 
/ = \iCigigi + 0(2/2, • . -^yn), A = Cigigi + 0 ( 2 / 2 , . . . , yn)- 
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By the hypothesis on h, evidently 4' is positive and of rank 
— 1 . Hence the discussion made for /, h, and \i is applicable to 
4^, and X 2 and leads to the segregation of terms X 2 C 2 gi and 
C 2 g% 5 ^ 2 . Repetitions of this argument show that / and h are re- 
ducible, by a non-singular transformation which is real in the 
quadratic case, to forms 

S = ^ ^ Xj % Qzj Ifi — Ct Qt g if 

in which the Ci are all positive. The transformation 

f i c^^g, (i == 1, . . . , n) 

is real and non-singular, and yields the canonical forms given in 
Theorems 10 , 11. 

As a special case of Theorem 10, we may take A to be + • • • 
+ Xr?, Then the linear transformation leaves h unaltered and is 
called orthogonal. Similarly, if in Theorem 11 we take A to be 
Xi xi Xn Xnj the transformation leaves h unaltered and is 
called unitary. Theorems 10 and 11 therefore imply (and are really 
no more general than) the following two: 

Theobem 12. If S is a real quadratic form of matrix A, the roots 
Xi, . . . , Xn of its characteristic equation | A — XI j = 0 are all real, 
and f can be reduced to ^ X^ by a real orthogonal transformation. 

Theorem 13. Iff is any Hermitian form of matrix A, the roots 
Xi, . . . , Xn of its characteristic equation | A — XI | = 0 are all real, 
mdf can be reduced to unitary transformation. 


Exercises 

1. A linear transformation with matrix B is orthogonal if and only if B' B 
= I and is unitary if and only if R' S = /. Hint: Take A = Ai = I in 
§§33, 34. 

2. The determinant of an orthogonal transformation is ± 1; that of a uni- 
tary transformation has the absolute value 1. 
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3. The product of two orthogonal transformations and the inverse of one 
are orthogonal. Likewise for umtary. 

4. If a linear transformation has two of the properties (i) is real, (ii) is 
orthogonal, (iii) is unitary, then it has the third property. 

5. Every real orthogonal transformation on x and y is of the form 

x' — X cos a ~ y sin Of, = ± (x sbj. a y cos a). 

6. The matrix of every unitary transformation of determinant unity on 
two variables is of the form 


C D- 

7. The transformation from one system of rectangular coordinates to 
another having the same origin is orthogonal. Then by means of Theorem 12 
show that the equation of any real quadric surface can be reduced to a form 
in which the terms of the second degree are ax^ 4- hy^ ± cz^j ax^ ± hy^j or 
ax^, where a, &, c are positive. 

8. In the canonical form (20), there are p positive and N — r — p negative 
coef&cients. The number p — N is called the signature. Two real quadratic 
forms are equivalent under real transformation if and only if they have the 
same rank and same signature. 

38. Rank of a symmetric or Hermitian matrix. 

Theoeem 14. If a certain r-^rowed determinant d of a matrix is 
not zero and if all (r + 1 )-rowed determinants having d as a first 
minor are zero, then the matrix is of rank r. 

Note that this theorem shortens the work of finding the rank r, 
since we need not prove that all (r + l)-rowed determinants are 
zero, but merely those having d as a first minor. 

After interchanges of rows or columns or both, we obtain a 
matrix 


<^11 * 

* * ain 

<Zml * 

amn 
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in which 


dll * * 

Ulr 

drl • * 

• drr 


0 . 


The theorem will evidently foUow if proved for matrix A and 
determinant A. Write 


1% — C-zl 2/l H“ Clin Vm 

m). 

Xi = 7/i + • • • + dir Vr, 

For s > r,t > r, the determinant of order r + 1 of the forms 

Xt + a^tVt (^ = 1, . . . , r, s) 

is zero by hypothesis. Hence these r + 1 linear forms are linearly 
dependent by Theorem 9 of Ch. Ill, so that 

r 

(23) X 5 + ~ S ^»(Xi + y *)• 

t=l 

Since A 0, we may solve equations X^ = yi + • ' • and 
express yi, . . . , linearly in terms of Xi, . . . , Xr. Substitution 
in Xa gives 

Xa = 61 Xi + • • • + Xr. 

Inserting this into (23) and noting that Xi, . . . , Xr, y « are linearly 
independent since yi, . . . , yr, y« are, we see that 

r 

Ci = bi (z = 1, . . . , r), Uai = 23 


I, = X, + '^a.tyt = ^hi (X,- auyt) = ^bili. 


Then 
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Since ?r+i; • • • ylm are therefore linear functions of lij * ,lr, 
which are linearly independent, matrix A of the is of rank r by 
Theorem 13 of Ch. III. 

We shall next prove theorems which enable us to find the rank 
of a matrix A having the property A' = A by an examination of 
its principal minors only, A 'principal 'minor of any square matrix 
A is the determinant of the matrix obtained from A by deleting the 
same rows as columns; in other words, its diagonal elements aU lie 
in the diagonal of A. 

Theorem 15. The rank of a symmetric or Hermitian matrix A 
is r if an r-rowed principal minor pr is not zero and if zero is the valw 
of every principal minor obtained by annexing to pr one row and the 
same column of A, and also of every principal minor obtained by 
annexing two rows and the same two columns. 

Without loss of generality we may assume that the elements of 
Pr lie in the first r rows and first r columns of A = {an). Let gn 
denote the determinant obtained from Pr by annexing the ith 
row and jth. column of A. Our theorem wiU follow from Theorem 
14 if we prove that Qij = 0 for all distinct values of i and f. Let 
c denote the determinant obtained from Pr by annexing the ith 
and jth rows and the -fth and jth columns of A . Let A t denote the 
cofactor of an in c. By a standard theorem on determinants, 

(24) cpr= 4” f*' ^ g,jgu - 

SX jx J 7 

By hypothesis, gn, g^j and c are zero. Since A' = A, if we inter- 
change the rows and columns of g^i, we get ptf. Hence 

0 = g%j g ]% = g%j Qi]] Qxi ^ 

Theorem 16. If all (r + 1 )“Towed principal minors and all 
(r + 2)-rowed principal minors of a symmetric or Hermitian matrix 
A are zero, the rank of A is r or less. 
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This is true when r = 0, since a^^ = = 0 and a^^ a^j — a^,• 

= 0 imply that aij == 0 and hence that every element a^j 
is zero, whence A is of rank zero. 

Proceeding by induction on r, we assume that the theorem is 
true when r = fc, that is, if all (& + l)-rowed principal minors and 
all (k + 2)-rowed principal minors are zero, the rank of A is ^ k. 
To prove the theorem when r = fc + 1, let all (k + 2)-rowed 
principal minors and aU (k + 3)-rowed principal minors be zero. 
Then if all (A; + 1 )“rowed principal minors are zero, the rank is 
g fc by the assumption just made. But if some (k + l)-rowed 
principal minor is not zero, the rank is fc + 1 by Theorem 15. In 
either case, the rank is g fc + 1, and the induction is complete. 

Theorem 17. Any symmetric or Hermitian matrix of rank r 
(r > 0) contains an r-rowed principal minor which is not zero} 

By the definition of rank r, all (r + 1 )-rowed principal minors 
are zero. If all r-rowed principal minors were zero, the rank would 
be g r — 1 by Theorem 16. 


Exercises 

1. By means of Theorem 16 prove that 2 is the rank of 


- 6 

- 5 

1 

- 2' 

-5 

56 

4 

11 

1 

4 

0 

1 

- 2 

11 

1 

2^ 


2. CaU a matrix A skew-symmetric if A' — — A Its determinant is zero 
if of odd order. Extend Theorems 15-17 to skew-S 3 nnmetric matrices. Using 
Theorem 17, prove that the rank of a skew-symmetric matrix is always even. 

3 If A is a real skew-symmetric matrix, P = -iA is Hermitian. Hence give 
a more direct proof of Ex. 2 The characteristic equation [A — XJ j = 0 of any 
real skew-symmetric matrix has only purely imaginary roots. 

^ For elementary, direct proofs, see Dickson, Wedderbum, and Bliss, Annals of 
Math., (2), 15, 1913, 27, 29; 16, 1914, 43. 
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4 By an alternate bilinear form is meant 


n 

a = S akjXkVj, 
A: . / -l 


afk = — ct&3. 


Show that, if ai 2 9 ^ 0, 

n 

Xt = ai2"^(X2 — 2 tti, X,), = Xi (i = 1, 3, 4, . . . , ti) 

J “3 

and the cogredient transformation on the j/’s replace a by Xi Fg — Fi X 2 -f 4 >, 
where ^ is an alternate bilinear form in X^, F^(^ = 2, . . . , ?^). Treating 4 > as 
we did cKj prove that any alternate bilinear form a with coefficients in any 
field F is equivalent under cogredient non-singnlar transformations with 
coefficients in F to 

172 ~ I2 ^1 + bVi I4 173 4 * •••-{- | 25-1 '928 — I28 1728-1. 

5. By an alternate Hermitian bilinear form is meant a form a in Ex. 4 in 
winch now a,* ~ If its matrix is A, show that B = iA is a Hermitian 

bilinear form. What is therefore the canonical form of a? 


Theorem 18. If M is a symmetric or Hermitian matrix of rank 
r {r > 0 ), we may derive a regular symmetric or Hermitian matrix 
A = (Oif) by a 'permutation of the rows of M and the same permuta- 
tion of the corresponding columns such that no consecutive two of the 
numbers 


(25) po == 1, Pi = ail, P2 = 


an 

0521 


ai2 

0522 


• J Vr 


an » • * air 

Ufi • • • Ofr 


are zero, where the pi are principal minors and pr 7 ^ 0 . 

Postponing the case in which all the elements of the diagonal of 
M are zero, let the tth element be not zero. After interchanging the 
first and ith. rows and first and ^th columns of M, we obtain a sym- 
metric or Hermitian matrix Mi having an 9 ^ 0. 

Postponing the case in which zero is the value of every two- 
rowed principal minor, containing an, of Mi, letj be such that the 
two-rowed minor obtained by deleting all rows and columns 
except those numbered 1 and j is not zero. After interchanging the 
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second and jth rows and second and 7th columns of Mi, we obtain 
a symmetric or Hermitian matrix M2 having an ^ 0, p2 = 

^2 — ^12 (^21 9^ 0 . 

Consider those three-rowed principal minors of M2 which have 
P2 as a first minor and proceed as before. 

In this manner we will ultimately obtain a symmetric or Her- 
mitian matrix Mr in which no one of the numbers ( 25 ) is zero 
unless we meet one of the postponed cases. The general such case 
is as follows. Let each of the numbers pi = an, p2, - . - , p* be 
not zero, where k < r, while zero is the value of every (fc + 1)- 
rowed principal minor having pjt as a first minor. By Theorem 15 , 
not every (k + 2)-rowed principal minor having p;^ as a second 
minor is zero, since otherwise the rank of M would be A; < r. 
After a permutation of the rows numbered k + 1, k + 2, ,n 
and the same permutation of the corresponding columns, we have 

Pa+ 2 0, Pft+i == 0. 

To prove the theorem by induction, assume that we have 
reached a matrix B in which no consecutive two of the numbers 
pi, . . . , p, are zero, while p* 5*^ 0, s < r. If not all (a + 1 )-rowed 
principal minors (of B) having p® as a first minor are zero, we 
permute rows and also columns numbered s + 1, . . . , ^ and 
obtain a matrix with also 0 . In the contrary case, we pro- 

ceed as in the preceding paragraph with k = s and obtain a matrix 
with also Ps+2 9^ 0, p^+i = 0. 

39. Kronecker’s^ method of reduction. 

Theobem 19 . Consider a quadratic or Hermitian form 

n 

( 20 ) Q = Q — ^ aijXiXj'p aji — Uxjj 

i, 

of rank r (0 < r < n) with coefficients in the field F» After a per- 
mutation of the x's we may assume from Theorem 17 that 1 a* / ] 7*^0 

^Werke, I, 166, 357. Cf. Dickson, Trans. Amer. Math. Soc., 10, 1909, 129, who 
gave applications to modular invariants. 
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{i, j = 1, ... 5 r). Then we may express the x^s in terms of new 
variables y^s by a linear transformation with coefficients in F of 
determinant unity such that Xi = yi ~ r + 1, . . . , n) and such 
that Q becomes 

r 

Qi = 2 Vi Vh 

i, 2~l 

whose terms have the same coefficienis as the corresponding terms of Q. 
According as x denotes x or x, write i = 2 or 1 ; then 

— ^t2^ aij Xi (^ = 1 , . . . , n), 

OXi 

Since the determinant of these n linear forms is of rank r, Theorems 
10 and 13 of Ch. Ill show that (the first) r of them are Unearly inde- 
pendent, while the remaining n — r are linearly dependent on those r: 


(27) 


(k = r+l,...,n)f 


where the are numbers of F. Consider the transformation 

n 

Xi = Vi- Cki Vk (f = 1, . . . , r), 

(28) *"’■+* 

Xk = Vk Qc = r + l,...,7i) 
of determinant unity and the implied relations 

n 

(29) Xi = 2^4*“ Ck%yk ^ "Lj • • m y r)j 


Xjs — yjt (Jo — T “f” • • • 9 


Hence 


— = 0 

dyk T^idXi dyk dXk 


(k = r+l,...,n). 
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and, since Q — Q, 

^ ^ = V — = 0 

Spa dpk dXi dpk dxk 

{k = r + 1, . . ,n), 

by (27) and the relation obtained from it by placing the symbol ~ 
over every number of it. Thus Q is transformed by (28 ) and (29 ) 
into a form Qi lacking yk and pk (k = r + 1, , n). Hence Qi 
is derived from Q by writing a;» = yu Xi = pi (i — 1, . . . , r), 
Xi = jgj, = 0 (i = r 4- 1, • • . , so that Qi is the form given in 
the theorem. 

In the following two theorems, Aij denotes the cofactor of flt/ 
in the determinant a of the form (26). 

Theorem 20. If 0, we may express the x’s in terms of new 

variables yi, ... ,yn^ o, linear transformation with coefficients in F 
of determinant unity such that Xn = yn and such that (26) becomes 

(30 ) ] y iV i A Vn Vn* 

i, /=! 

In proof, consider the form 

^ .. a ^ 

<f> — / j (L%j Xi Xj . Xn Xn* 
i, j=l 

Its determinant is derived from a by replacing a„„ by — a/ Ann 
and hence is equal to the sum of a and a determinant obtained from 
a by replacing the last row by 0, . . . , 0, — afAnn- Hence the 
determinant of is 

CL Amn * a/ Ann “ 0- 

Thus is of rank » — 1. Theorem 19 therefore shows the existence 
of a transformation of the t 3 T)e specified in Theorem 20 which 
replaces <j> by 
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n— 1 

dtj y%y j 

J“1 

and Xn Xn by y^ yn, and hence replaces (26) by (30). 

Theorem: 21. If Ann - 0, An-i n-i = 0, An n-^i 9^ 0, we may 
express the x^s in terms of new variables yi, ynhy a linear trans-- 
formation with coefficients in F of determinant unity such that 
Xn-i = 2/n-i; Xn = Pn, omd such that (26) becomes 

7t-~2 

(31) Y!, Vi + gvn-i Vn + gvn Vn-i, g= 

i, 3=3 

Let b denote the determinant obtained by deleting the last two 
rows and last two columns of a. Then, as in (24), 

(32) ab= ^"-1" = - Ann-lAnn-l9i0. 

-^nn— 1 -^nn 

Consider the form 

n 

^ ^ yXn—l Xn QXn 1- 

if 3*X 

Its matrix is 

^11 • ' * ai rt-l Clin 

M- • 

1 1 * * * C^rj.— 1 n— 1 ^71—1 n Q 

anl * * * an n—1 ““ Q Clnn 

The determinant obtained by deleting the last two rows and last 
two columns is 6, which is not zero by (32) and is the pr of The- 
orem 15. Next, the principal minor of M obtained by deleting the 
last row and last column is Ann = 0, while that obtained by delet- 
ing the {n — l)th row and {n — l)th column is An-i n-i = 0 
[they are the only (n — l)-rowed principal minors having 6 as a 
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first minor]. Finally, if d is the determinant of M, the theorem 
which gave (32) implies 

77 ji_l An— 1 n *4” xT 

do — 1 I I, >4 

An n-1 + Ann 

where f = An + firb, since b = b. Replacing g by its value in 
(31) and using (32), we get 

An n—l ^ ~ b(^gAn n—1 u) 0, t 0. 

Hence db = 0. Thus d = 0. Hence M is of rank n — 2 by 
Theorem 15. Hence there exists by Theorem 19 a transformation 
of the type specified in Theorem 21 which replaces by 

71—2 

'EaHViVi, 

£, 7=1 

and therefore replaces (26) by (31). 

If we apply to (31) the transformation 

yi == Xi(i ” 1 , . . . , 2 ), yn—l “ ^n—1 

yn ~ (X /9 )^n— 1 4" 
of determinant unity, we obtain 

n-2 ^ 

(33) J2aijXiXj + 2Xn-\ Xn^l — \QgXn 5n, g = a! Ann- 1 - 

Let Q be a form (26) of rank r with coeflSicients in a field F. 
We can find by means of Theorems 19-21 a linear transformation 
of determinant unity with coefficients in F which replaces Q by 

(34) giXiXi+ V grXr Xr {gi = gi^ 0), 

in which the g^s are obtained explicitly in terms of the coefficients 
an of Q by formulas which differ in the various subcases neces- 
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sary to consider. We first pass at once to the equivalent form Qi 
of Theorem 19. 

(i) If not all (r — l)-rowed principal minors of the determinant 
a of Qi are zero, we may permute the x's and assume that the co- 
factor Arr of Urr iu u is uot zovo. Wc obtaiu the equivalent form 
(30) of Theorem 20 with n = r, having the segregated term 

{alArr)XrXr. 

(ii) But if aU (r — l)-rowed principal minors of a are zero, 
the cofactor Aij of at least one element a^J(i 9 ^ j) is not zero, 
since the rank is r. Permuting the rr’s, we may assume that 
At r-i 9 ^ We obtain the equivalent form (33) with n = r 
having the last two terms segregated. 

Treating the unreduced part (under the summation sign) as in 
(i) or (ii), we finally obtain (34) in which the coefficients gi are ex- 
pressed in terms of determinants of the matrix of the given form Q. 

For example, if the first of the two alternatives (i) and (ii) 
holds true at each stage of the reduction, we obtain the equivalent 
form (34) with the coefficients 


(35) 


. . . , Ijl, 

PO Pi P2 Pr-2 Pr-1 


where po = 1 and the remaining p's are the special principal 
minors (25) and each is not zero in the present case. 


40. Number of positive coefficients in the canonical form. By 
Theorem 18, the variables of Q may be so permuted that no con- 
secutive two of the numbers po, Pi, • . . , Pr zero, while pr 9 ^ 0; 
caU such a form regular. 

Let the field F be real when Q is a quadratic form. When Q is an 
Hermitian form, let F be a field obtained from a real field S by the 
adjunction of where r is in S and is negative. Then if a; = a 
+ hv^, where a and h are m S, xx = d? v¥ is positive unless 
X = 0. In either case, pi = p* is real, while the coefficients of the 
last two segregated terms of (33) are of opposite signs. 
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Let P be the number of permanences of sign and V the number 
of variations of sign in the sequence po, Pi, • • • , Pr, with the agree- 
ment that a vanishing pi is counted as positive or negative at 
pleasure, but must be counted. 

Let p be the number of positive coefficients and n the number 
of negative coefficients in the equivalent form (34). 

If each Pt 7 ^ 0, these coefficients are given by (35). For each 
permanence (variation) of sign, two consecutive p’s are of like 
(opposite) signs and their quotient is a positive (negative) coeffi- 
cient (35), and conversely. Hence p = P, n — V. 

This result holds true also when any p, is zero. For, pi_i and 
P .-+1 are then each 0 (since Q is regular) and have opposite 
signs by (32) with n = i •+• 1, a = Pa+i, h = p»_i, Ann = p.- By 
the final two terms of (33), we now have two coefficients of oppo- 
site si gns instead of the two fractions (35) which involve p,-. We 
have P = F = 1 for the sequence pi-i, Pi = 0, p,+i whose 
signs are -h + — or — ± -i- by the agreement concerning zero. 
In other words, by ignoring the permanences and variations of 
sign in this sequence of three terms, we reduce P, F, p, and n each 
by 1. After aU such deletions due to vanishing p,’s, we saw that 
p = P, n = V. Hence the latter hold also before the deletions. 

Theorem 22. Within any realjield, any quadratic form of rank r 
is equivalent to one of type gix^ A QrXf'. Within a field 

obtained from any realjield S by the adjunction of the square root of a 
negative number of S, any Hermitian form of rank r is equivalent to 

one of type gi a;i ici -h V grXr Xt, with each gi in 8. Then the 

number of positive g’s is the number of permanences of sign in the 
sequence po, pi, . . . , pr, no consecutive two of which are zero, as may 
be assumed after a suitable permviation of the x’s. 

For example, let Q = sF + 2*2 Xs -|- ZtK Then po = 1, pi = 1, P 2 = 0, 
333 = — 1 , P 4 = — 1, P = 3, F = 1. Since Q is equivalent to + 2 ( 

+ Xi% vie have p = 3, w = 1. 
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THEORY OF LINEAR TRANSFORMATIONS, 
INVARIANT FACTORS AND ELEIMENTARY DIVISORS 

Weierstrass^s elementary divisors and the related invariant 
factors are here introduced in a natural manner in connection with 
the classic and a new rational canonical form of linear transforma- 
tions, respectively. 

41. Rational canonical form of a linear transformation.^ Let 
Wi, ... ,Wn be independent variables. Let S be any linear trans- 
formation 


S: Wi=^a^J'Wi (^ = l, ...,n) 

with coefficients in any field F. We do not exclude the case in 
which the determinant of S is zero, so that the matrix of the coeffi- 
cients of S is perfectly general. 

Let Xi be any homogeneous linear function of toi, . . . , Wn whose 
coefficients belong to F and are not all zero. Let Xi denote the 
corresponding function of TVi, . . . , Wn- If Xi = cxi, where c is 
in F, we have (1) for o = 1, with the understanding that the final 
equation alone is retained. But if Xi ^ cxi, we write oih for Xi. 
Let X 2 denote the same function of the Wi that X 2 is of the Wi. 
Then if X 2 is a linear function of xi and X 2 , we have (1) for a = 2; 
in the contrary case we write xz for X 2 and have three linearly inde- 
pendent fimctions Xi, x^^ Xz, Proceeding in this manner, we obtain 


^For the case in which the determinant is, not zero, S. Lattes, Annales Fac. Sc. 
Toulouse, (3), 6, 1914, 1-84. Also W. KruU, Uber Begleitmatrizen und Elementar- 
teilertheorie, Freiburg thesis, 1921. In Leipzig. Berichte, 69, 1917, 325-35, G. 
KowalewsM employed points, instead of linear functions, and obtained a canonical 
form whose matrix is the transpose of that in the text. The text presents the theory 
as found independently by the author several years ago. 

89 
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a chain of linearly independent functions Xij X 2 j . . . ^ Xa with 
coefficients in F such that S implies 

(l ) Ax ~~ X^f -A2 XSj • • * j Xa—l Xaj -A<j \Xlj . . . , 

where the bracket denotes a homogeneous Knear function of 
Xiy , . . j Xa with coefficients in F. We call a the length of the 
chain, and Xi its leader. 

Theorem: 1. By the introduction of new variables which are 
linearly independent homogeneous linear functions of the initial 
variables Wx with coefficients in the field F, any linear transformation 
S with coefficients in F may be reduced to a canonical form defined by 
(1) and 


(2 ) Yi y 2 j Y2 yz) . . . , F 5—1 y Yh [^1, . . . > ^ 5] 5 

( 3 ) Zi = Z2, F2 = ^35 • • • 7 Zc^i = ^cj Zc == [zi ) ... 7 Zcl; 

etc., where a is the maximal length of all possible chains, h is the 
maximal length of a chain whose leader is linearly independent of 
xi, . . . , Xa, and c is the maximal length of a chain whose leader is 
linearly independent of Xi, ,,, , Xa, yi, • . • , y^* 

Ifa = l, SisTFi = auWi (i = 1, . . . , n), and is in canonical 
form with n chains. Henceforth let a > 1 and select a function 
Xi which is the leader of a chain of maximal length a, so that we 
have (1). li n = a, the theorem is proved. 

For n > let Ui be any homogeneous linear function of the 
initial variables Wi with coefficients in F which is linearly inde- 
pendent oi Xi, , , , , Xa^ Let Z7i denote the corresponding function 
of the Wi, If Ui= [xi, — , Xa, Ui], we have (4) for 5 = 1. In 
the contrary case, write % for Ui. Then if U 2 == [xi, , . , , Xa, ui, Uz], 
we have (4) for 6 = 2. In the contrary case, write Uz for U 2 * 
Proceeding in this manner, we get 

(4) Ui — u%, U 2 — Uz, , . . , = Uft, Ub I + [ui , . . . , Uh]f 
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where Z = [xi, . , . , Xa], and Xi, . . , , Xa, Ui, . . . jUb are linearly 
independent. If i = 0, (4) is the desired result (2) with yi = u^. 

Next, let 1^0. Then I + , w&] is linearly independent 

of uij ... jUhj so that Ui is the leader of a chain whose length ex- 
ceeds b. Hence b < a. Introduce the new variables 

yi = ui+fy y 2 = U 2 + fh • • • , yb ub + /&-i, 

where / is a homogeneous linear function of xi, ... ,Xa with 
coefficients in F, and S replaces / by /i, /i by / 2 , etc. We can choose ^ 
/ so that (2) follows. 

To obtain the third chain, let vi be any homogeneous linear func- 
tion of the initial variables with coefficients in F such that 
Xi, . . . yXa,yu . • • * f y-bi vi are linearly independent. Let Fi denote 
the corresponding function of the W i. If Fi = [xi, . . . , Xo, 
2 / 1 , , Vhy ^i], we have (5) for c = 1. In the contrary case, write 
v% for Fi. Then if F 2 = [xi, . . . , Xa, 2 / 1 , ... , 2/&; Vij we have 
(5) for c = 2; in the contrary case, write vz for F 2 . Proceeding in 
this manner, we get 

(5) Fi = Vz, Vz^ Vz,. . Fc~i = Vc, Fc = Z + m, 
where 

I = [xi , . . . , ra, 2/ij • • ■ j yb], m = Si + hScVc. 

Here xi, . . . , Xa, 2 / 1 , , yhj vi, . . . ,Vc are linearly independent, 
and the Si are numbers of F. If Z = 0, we have the desired result 
(3) with Zi == Vi. 

Next, let Z 0. Then Z + m is linearly independent of Vi, . . ,Vc, 
so that vi is the leader of a chain whose length exceeds c. Since Vi 
is linearly independent of , Xa and since b is the maximum 

length of a chain whose leader is linearly independent of xi, . . ,Xa, 
we have c <b. Introduce the new variables 

^The proof is like that concerning (6) with the y% suppressed andsi replaced by 
yx, c by and, beginning with (8), also h by a. We prefer to omit the details here 
and treat the more typical derivation of the third chain. 
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(6) == +/^ ^2 = ^2 + /l? * . . j +/c-l; 

where / = /o is a homogeneous linear function of Xi, ,,x ay 
Vi, ‘ • jVh with coefficients in F, while S replaces / by fi, fi by / 2 , 
etc. Let Zj denote the same function of the W^ that is of the Wz. 
Hence 

c 

(7) Zi = z^y Z 2 = Zzj . , , , Z c^i = Zcj Z c = "hX!) 

where 

c 

^ + /c 

z^l 

We can choose / so that h becomes a homogeneous linear func- 
tion of x^J yi{i = 1, . , . , c). The proof is simpler by successive 
steps. First, if is the coefficient of Xa in Ij take / == — kxa^cy 
whence fc == — kxay and h lacks Xa^ Since (7) is of the form (5), 
we may therefore assume that I itseK lacks Xa- Then if i is the 
coefficient of Xa^i in I, take / == — tXa^c^-h whence /c ~ 
and h lacks both Xa^i and Xa^ The last of these steps employs 
f — — gxi to delete Xc+i from Zi. Similarly, we can choose the 
coefficients of 2 / 6-0 Vb^c-i, • • • ? 2/i of / to delete 2 / 6 , Vb-iy * y 
2/c+i from li. 

If the resulting function h is identically zero, (7) is the desired 
result (3). In the contrary case, write 

1% == tiXi "h ' • • + Zc + W^l 2/i + • • • + '^c Uc+i-l 

(Z = 1, , . . , 6 — c + 1). 

Since we proved that h is given by this formula with i = 1, the 
fs and m’s are not all zero. Let Li denote the corresponding func- 
tion of the Z's and F^s. Hence 


( 8 ) 


Li = Zi+1 


(f = 1, — jh — c). 
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Evidently = — c + l) are linearly independent 

linear functions of x„ y,{j = 1 , . . . , 6 ). By ( 7 ) and ( 8 ), the 
chain with the leader Si has the successive terms 

C 

^ 1 } • • • 3 ~ ^3 H“ (j — 1 , . , , , 6 — C -f- 1 ), 

and possibly further terms. By ( 6 ), the & + 1 terms z are linearly 
independent. Since the leader Zi is linearly independent of , 

Xa, this result contradicts the definition of b. 

42. Theorem 2. In a canonical transformation the characteristic 
determinant of the partial transformation on any chain is divisible by 
that of the next chain. 

For definiteness let the chains be the second and the third, and 
let the partial transformations on them be 

(9) Yi = ^2, F 2 = ^3, . . . , F 6~i — Vb, Ffe = Vi] 

(10) Zi = Z2j Z 2 — Zz, • • • , Zc-1 = Zcj Zc = ^ Ci Zi. 

i-1 

Then c ^b. Consider also the linear functions 

(1^) ^c+l = Fc, Zc +2 = Zc+l) • • • 3 ^b+l ~ Zbj 

which may be computed in terms of zi, , , , , hy the recursion 
formula 

c 

(12) ^C+3 ~ ^i+3-1 0* ~ — C -f" 1), 

The chain with the leader = ^/i + zi has the successive terms 
Wj = yj + Zj{j = 1 , . . . , 6 ), which are linearly independent since 
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2 / 1 , , 2/6, Zi,...,Zc are independent. For, if ^ fc,( 2 /, + z,) 
= 0, then '^k}yj+ Z = 0, where Z is a hnear function of 
Zi, ... ,Zc, whence each kj = 0, Z = 0. Since i is the maximal 
length of a chain whose leader is linearly independent of xi, . . . , a; a, 
the chain led by Wi is of length g b, and hence is of length b. 
Write TF,- = 7,- + Zj. Then 

6 

1^6 = b, 2/» + 26+1 = [u’l, • ■ . , W6] 

i=L 

b b 

= b. Wi = 22 b.(2/i + Zi), 

i=«l 

whence 

6 

(13) 25+l=22bi2i. 

i^l 

lie = bf then Zb+i = ^ Ci Zi by (10) or (12), whence Ci — bi, so 
that (9) and (10) have the same characteristic determinant. 
Next, let c <b. If from the difference of the two members of 

(13) we eliminate 0 ?,, . . . , Zc+i in turn by means of (12), we 
obtain a linear function of . . . , whose coefficients must aU 
be zero. By a mere change of notation, we conclude that, if we 
eliminate X^, . . . , X^"^^ in turn from 

6 

(14) X^+i 
by means of 

(15) = E c. (,• = 1, . . . , 6 _ c + 1), 

i=l 

we obtain a linear function of X, . . . , X' whose eoeflB,cients are all 
zero. These eliminations are evidently the successive steps in the 
ordinary process of division of (14) by 
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(16) X'+i - 23 c. X‘. 

Hence (14) is exactly divisible by (16). Removing the factor X 
from each, we obtain the characteristic determinants of (9) and 
(10) respectively, signs apart. For, if iV' is the matrix of (9), its 
X-matrix — Xl is 

-X 1 0 0 ••• 0 

0 - X 1 0 • ••0 


( hi 62 hz 64 hb — Xj 

whose determinant, called the characteristic determinant of N, is 

( 18 ) (- 1 )HX^ ~ 61 - 52 X ~ &3 X 2 55 X^-^). 

Example 1. Let S be TFi = awi + hw2} W2 = wi, W3 = hw2 + awz. 
Employing the new variables xi = ^2, = Wi, y ^ loi, Xi = W2, etc., 

we get 

C i Xi = x%, X2 — bxi -f" Y ~ ay. 

Apparently we have a canonical form composed of a chain Xi, x% and a chain 
y. But the characteristic determmant — aX — b of the partial transforma- 
tion on iCi and is divisible by the characteristic determinant a — X of 
Y = ay only when & = 0. If 5 = 0, C is the true canonical form of 8, since 
every linear function of Xi^ xz, y, in which the coefficient of xi is not zero, is the 
leader of a chain of length 2; while, if that coefficient is zero, it is the leader of 
a chain of length 1. If & 7^ 0, take 

Zi = Xi-\-yy = Zi + F = X 2 + 02/ = 

Z2 = X2 "b OtY 

The determinant of the coefficients of rci, xa, y is 
when 6 5^ 0 is therefore 

Zi — Z2, Z2 == 23, Z3 = ahzi + (5 — a^)z% + 2 azz. 

Example 2. There is a single chain, say with the leader Xi = Wij in the 
canonical form of 

TTi = w72 + Wzi Wt = wx-\-v}Zi W z = wi W 2 Wz, 


= 6x1 -f + a® 2/ ~ ^ 3 * 

— b. A true canonical form 
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43. Theorem 3. If di, , dk are the characteristic determinants 

of the partial transformations on the variables of the first, , , , , kth 
chains of a canonical linear transformation whose \--matrix M has 
n rows, and if denotes the greatest common divisor of all {n — i)- 
rowed determinants of M, then 

d% * • • dk, 02 ^ ds • • • dk} - • - j 0 k— 1 ~ dk} 0j ^ 1 (i S k). 


We employ the condensed notation 


Ai 0 0 • • • 0 0 
0 A.O 0 • • • 0 0 


0 0 0 0 Ak 


where Ai, . . . , A ^ are the X-matrices of the partial transformations 
on the variables of the first, . . . , ^;th chains of lengths a, e. 

For example, A 2 is given by (17). Their determinants are 
di, . . . , d*. Let aij, b ^, ... be the elements of Ai, A 2 , . . . in the 
ith row and jih column. 

The minor ilf' of an element in one of the 0 matrices of M is 
zero. For example, let Af ' be obtained by deleting a column con- 
taining elements of Ai and a row not containing elements of Ai. 
Evidently every a-rowed minor formed from the first a rows of Af ' 
has a column of zero elements. Hence Laplace’s development of 
Af' according to the first a rows gives M' = 0. 

The minor of hi in (17) is 1. Similarly, the minor of aai in Ai is 
1. Hence the minor of Uai in Af is = cfe • * • dk, while the minor 
of any aij is evidently a multiple of 0 i, The minor of any bij is a 
multiple of didzdjk * dk and hence of gi, since di is a multiple of 
d 2 by Theorem 2. The minor of any Ci/ is a multiple of di d 2 d 4 
• • • d* and hence of ^ 1 . In this manner we conclude that 0 i is the 
greatest common divisor of aU {n — l)-rowed minors of Af. 

Any (n — 2)-rowed minor is zero if it is obtained from Af by 
deleting two rows and two columns crossing in elements of a zero 
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matrix ; the proof is the same as for M' above. The minor obtained 
by deleting from M the row and column crossing in Gai as well as 
those crossing in hhi is evidently — dz • • • while the minor 
obtained by deleting the row and column crossing in any Gi/ as 
well as those crossing in any hi j is clearly a multiple of grg. The 
minor obtained by deleting from M the row and column crossing 
in any aij as well as those crossing in any Cij is evidently a multiple 
of ^2 ^4 * • • djo and hence of ^ 2 . 

Proceeding in this manner, we have Theorem 3. 

44. Invariant factors of a canonical transformation. By (18), 
we have 

di = { d2 = i 1)^ . . . y 

where In, * • ^ , In^k+i are polynomials in X in which the coefficient 
of the highest power of X is unity. Write 

(19^) In—k = 1, . . . , Ji = 1. 

It is customary to call I j the jth invariant factor of the X-matrix 
of the canonical transformation C. Hence, apart from sign, the 
invariant factors distinct from unity are the characteristic deter- 
minants of the partial transformations, one for each chain of C. 

Theokem 4. If Ij is the jth invariant factor of the nrrowed 
\-matrix M of a canonical transformation , then I j divides I y+l* -(f 
G j is the greatest common divisor of all j-rowed determinants of M 
chosen so that the coefficient of the highest power of \ in Gj is unity, 
and if Go == 1, then 


( 20 ) 

Gj* = Ji J 2 • • • J/ 

11 


( 21 ) 

2,- = Gj/Gj-i 

U = 1 , ■ • 



so that the G’s uniquely determine the Ts, and conversely. 
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The first statement follows from Theorem 2. Next, if we write 
dlj = 1 when j > fc, we see by Theorem 3 that Gn^i = idi+i 
• • • dn for 0 ^ i < n. By (19) and (19'), d,* == ± In- 3+1 for 
1 g j g n. Hence we have (20) and therefore also (21). 

45. Theorem 5. If in a linear transformation with the matrix A, 
we introduce new variables defined by a non-singular linear trans- 
formation with the matrix B, we obtain one with the matrix BAB~'^. 

Using the same notation for a linear transformation and its 
matrix, let 


n 



A: 

Hi — ^ii ^3 

j»l 

n 

11 

(22) 

B: 

bik Xjc 

h^l 

n 

(i = 1, . . . , n), 

(23) 

B: 

Yi^Y^bikXk 

(f = 1, . . . , n). 

Since i 

l&i.h 

0, we may solve 

equations (22), and obtain 


B-h 

n 

Xj = T**! Cjs yt 

11 




Elimination of the Xi and the Xj between (23), A, and gives 

BAB Y i ^ ^ ^ bijs ajcj C] s y 8 ('^”1,...,?2-), 

if « 

46. Rotations and orthogonal transformations. We shall illus- 
trate the introduction of new variables by an important example. 
If wi, W 2 ,\wz are the coordinates of an arbitrary point W referred 
to given:reciangular axes Owi, 0 w 2 , Owz, we seek the coordinates 
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Ph P 2 j pz of the point P derived from W by the rotation about ON 
through angle a counterclockwise when viewed from N toward 0. 
Let Uij Th) ^3 be the direction cosines of ON, Choose new rectan- 
gular axes Ox, Oy, Oz, whose direction cosines with respect to 
Owi, Ow 2 , Owz are h, hj k; mi, m 2 , m 3 ; 7 ^ 2 , ns, respectively. Re- 

ferred to the new axes, let W and P have the coordinates x, y, z and 
Xj Yj Zj respectively. Since the axis of rotation ON is now Oz, we 
have 

(24) Z == rc cos a — 2 / sin a, T = :r sin a + 7 / cos a, Z = z, 
by plane anal 5 diics. By solid analytics, we have 

X = h Wi h W2 I3 Ws, Wi == li X + mi y Til z, 

(25) y == n^lWl + + msWs, Wz = kx + m^y + uzz, 

z ^ niWi -jr nzWz + ns Ws, Ws — kx + msy + ns z. 

Let D be the determinant of the transformation T defined by the 
first triple of equations. The second triple defines the inverse 
transformation T~S whose determinant is derived from D by inter- 
changing rows and columns. Hence DD = 1, P = + 1. After 
reversing if necessary the positive direction of one of the new axes, 
we have P = -h 1. Solving the second triple of equations for z 
and comparing the result with 2 = ni + . . . , we see that 

(26) ? 2 m 3 — Z 3 m 2 = Ui, lims — hmi = — n^, Zi m 2 — Z 2 mi = ns. 


We must express transformation (24) in the new variables 
Wi, wz, Ws defined by (25) and 

Pi = Zi Z + mi Y UiZ (i = 1, 2, 3). 

In the latter insert the expressions (24) and then the values of 
X, y, z from (25). We get 


3 

P» == X) iO'i h' + mim, Ocos a + {l^mi — Umj) sin a + niTijlwj, 

/-I 
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By the properties of direction cosines of perpendicular Hues, the 
coefficient of cos a is equal to — or 1 — according as 
j ^ ioTj = i. The coefficient of sin a is one of the values (26) oi 
zero in the corresponding cases. Write 

(27) d = cos a» = n^ sin {i = 1, 2, 3). 

Then cos a: = 2 ^^ — 1 . Thus the coefficient of Wi in pi is 


(1 — n^^) cos a + n^^ = 2d^ — 1 + 2{ni sin |a)^. 


Next, for j 9 ^ the coefficient of -u?, is 

— cos a) + 6,4 sin O' = 2n^n^wi^ ia 

+ 2eji Uh sin cos fa, 


where k denotes that one of 1, 2, 3 which is distinct from i and jj 
while 

C12 = C23 = ^31 = + 1 ) Ci 3 = 621 = 632 = — 1 . 

Inserting the values from (27), we get 

1 , 2, 3 

Pi = {2d^ — 1 + 2 a»^) Wi + [2ai a ? + 2eji ak d] w^y 


or, written at length, 

pi= (2aF+2(f2~ 1 )wi+2 {aia^~azd)w^+2 {aiaz+aid)wz^ 

(28 ) P 2 = 2 {aiCh-\-azd)wi-{- ( 2 a 2 ^+ 2 (f^ — 1 )w%-{-2 (chCLz *— did )wz} 
pz==2(aiaz—(hd)wi+2 {a2az+aid)w2+ (2az^+2d^—l )wz. 

Theorem 6. Formula (28) represents the rotation about the axis 
ON with the direction cosines ui, n^y nz through angle a counterclock- 
wise when viewed from N toward 0. 
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The values of d, ai, < 22 , az are given by (27), whence 4- 
■4“ CL^ 4“ = 1* 

Consider a real ternary orthogonal transformation 

T: yi = h^iXi + h^a>i + hizXz (i = l, 2, 3), 

whose matrix B is of determinant 4~ 1. Thus B'B = L If D is 
the determinant of 5 — Z and hence of 5' — I, then 

B'iB-I) = - (B’ -I), \B'\D=-D, 2D = 0. 

The point N = (ni, 712 , ns) whl be unaltered by T if 

n* = bii Ui 4" bi2 712 4“ bis (i = 1^ 2, 3), 

which may be written as homogeneous equations having the deter- 
minant D = 0, Hence there exist real solutions ni, 712 , tiz, not aU 
zero. Let T replace the point X = (xi, Xzj Xz) by F = (^ 1 , 2 / 2 ? yz)* 
Then 

yi — n, = bii(xi - ni) 4 “ bQ(x 2 -712) + — ns). 

Hence — ntY = ^(xi — 7iiYj for the same reason that 

^y%^ = ^x^ for the orthogonal T. Hence if 0 is the origin, 
OF = OX, NY = XX. Thus the triangles OXN and OYN are 
congruent. Hence X and F lie on a circle whose plane is perpendic- 
ular to OX, so that T represents a rotation about OX. 

Any orthogonal transformation of determinant — 1 is evidently 
the product of one of determinant 4- 1 by X = x, F = ?/, Z = — z. 
The latter represents a reflexion in the X 2 /-plane. 

Theorem 7. Every real ternary orthogonal transformation repre- 
sents a rotation or a rotation followed by a reflexion^ according as its 
determinant is 4“ 1 or — 1. 

Corollary. Every real ternary orthogonal transformation of 
determinant 4- 1 is represented by formula (28). 
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Cayley^ expressed the coefficients of the ?i-ary orthogonal 
transformation in terms of \n{n — 1 ) parameters; but his for- 
mulas do not include all orthogonal transformations, except as 
limiting cases. See Exs. 3-6 of §54. 

This subject is only a special topic of the theory^ of linear trans- 
formations which leave unaltered a quadratic or bilinear form. 

47, Theorem 8, Let A and B he n-rowed square matrices such 
that the elements of A are polynomials in X, while those of B are inde- 
pendent of X, and \ B \ 9 =^ 0. Then the greatest common divisor D i (X) , 
with leading coefficient unity, of all t-rowed determinants of AB is 
equal to that of A, 

Let dt be the greatest common divisor of ah i-rowed deter- 
minants of A. By Theorem 5 of Ch. Ill, Dt is divisible by dt. 
Since A is the product of AB by the same theorem shows that 
d t is divisible by D t. The two results imply Dt = dt. 

Since a like theorem holds for BA, we deduce the 

CoEOLLAKY. Let A, P, Q he n-rowed square matrices such that 
the elements of A are polynomials in X, while those of P and Q are 
independent of X, and P and Q are non-singular. Then the greatest 
common divisor of all irvowed determinants of PAQ is equal to that 
of A. 

48, In particular, let A be the X-matrix of any linear transforma- 
tion T, the matrix of whose coefficients may also be denoted by 
T, so that A ^ T — XL. By the introduction of new variables 
defined by a linear transformation with the non-singular matrix 
B, T becomes a transformation C, the matrix of whose coefficients 

^Scott’s Theory of Determinants, revised by Mathews, 1904, 197-202; Pascal^s 
Determinants, §§47-60, which cites twenty writers on orthogonal transforma- 
Uons. 

®Encyclop6die des sc. math., t. I, v. 11, 478-520. Cf. Muth, Elementartheiler, 
1899, 160-179. For Hermitian forms, Loewy, Nova Acta Leop. Carol. Akad., 71, 
1898, 379-416; extract in Math. AnnaXen, 50, 1898, 557-76. 
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may also be designated by C. By Theorem 5, C = Write 

D ^ C — }J. Then D = BAB^^, In view of the Corollary, the 
greatest common divisor of aU i^-rowed determinants of D is equal 
to that of A. This result may be stated as follows: 

Theorem 9. Let any Umar transformation T become C by the 
introduction of any new independent variables. Then the greatest 
common divisor Gt of all t-rowed determinants of the Tsrmatrix of T 
is equal to that of C, 

49. Canonical form determined by invariant factors. Let the 
new variables be chosen so that C is a canonical form of T. Since 
the X-matrices of these transformations have the same Gt, they 
have the same 1/ == GjfGj^i which, by (19), uniquely determine 
the characteristic determinants of the partial transformations on 
the variables of the various chains of C, and hence, by (17) and 
(18), uniquely determine the matrix of C, 

Theorem 10. For any linear transformation T, the quotient 
I,* = Gj/Gj^i is a polynomial in X called the jth invariant factor of 
the \-matrix of T, The matrix of the canonical form of T is uniquely 
determined by these invariant factors. 

In other words, any linear transformation has a single canonical 
form apart from the notation for the variables. 

50. Consider the transformation T defined by (1), (2), (3), 
etc., such that the characteristic determinant of (1) is divisible by 
that of (2), while the latter is divisible by that of (3), etc. Then 
T is its own canonical form, so that a,b,c,.,, have the properties 
stated at the end of Theorem 1. Hence we have 

Theorem 11. There exists a linear transformation with coeffi’^ 
dents in any field F whose X-matrix has any prescribed invariant 
factors I f with coeffidents in F such that I ,• divides I for every j. 
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Si. Similar transformations. Two linear transformations (or 
matrices) S and T m F (i.e., with coefficients in F) are called 
similar in F if there exists a non-singnlar transformation R in F 
such that = T, In other words, S becomes T by the intro- 

duction of new variables defined by transformation B. Theorem 9 
states that the X-matrices of two similar transformations have the 
same Gf 

Conversely, if the X-matrices of S and T have the same Gt, 
then S and T are similar in F. Let TJ = and V = RTR^^ 

be the canonical forms of S and T, Since the canonical matrices 
U and V have the same Gt, they are identical by Theorem 10. 
Hence 

QSQ-i^V, R-WR^T, (R-^Q)S{R-^Q)-^ = T, 
so that S and T are similar. 

Theorem 12. Two linear transformations {or matrices) in F are 
similar in F if and only if their \-matrices have the same invariant 
factors. 


52. Invariant factors. Let M and N be any two n-rowed square 
matrices of which N is non-singular. Write A = MN-\ Then 
M — XN = (A — \I)N has the same (?,• as A— XJ by Theorem 8. 
For the latter, 1/ = Gj/Gj^iis a polynomial in Xby Theorem 10, 
and hence may be defined as the jth invariant factor of ikf -- XN. 

Removing the restriction that N be non-singular, let G, denote 
the greatest common divisor of all j-rowed determinants of pM 
+ crNj where p and <t are independent variables. Since each such 
determinant may be expanded according to the elements of a row 
(or column) and hence is a linear combination of certain (j — 1)- 
rowed determinants of pM -f aN, Gs^i is a factor of every j-rowed 
determinant and hence of Gj, The quotient (?,/(? ;_i is therefore a 
homogeneous polynomial in p and <r, and is called the jth invariant 
factor of pM + aN, 
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Exercises 

1. Xi = X 2 j X 2 = ax 2 f Xs — axs has the invariant factors X(X — a) and 
X — a. 

2. The invariant factors of 

0 0 2 ~ X 

0 1 + X 2X 

2 - X 0 0 

are (X — ,2)(1 4- X) and X — 2. 

53. Classic canonical form.^ For certain purposes a canonical 
form involving irrationalities is preferable to that in Theorem 1. 

Theorem 13. Start with the canonical form in Theorem 1, with 
coefficients in afield F, composed of partial transformations of the type 

a 

(29) Xx ~ X 2 tTsj . . . j X a—i ~ ^o) Xa ^ a^ Xt» 

i=l 

Let the characteristic equation A(X) = 0 0 / (29) have the roots 
R, S, T, . , * , of multiplicities r, s, , respectively. By the intro- 
duction of new variables Ui^VifWi, . . . , which are linearly independ- 
ent homogeneous linear functions of Xi, , Xay (29) can he reduced 
to the classic canonical form 

( TJi = Rui, JJj = Ruj + 0* == 2, . . . , r), 

Vx = Svx, 7,- = Sv,^ + Vi^x 0* = 2, , . . , 5 ), 

TFi = Twij W/ = Twj + Wi^x 6’ = 2, . . . , — • 

Here the Ui {or Vi, wi) are homogeneous linear functions of the x’s 
whose coefficients are polynomials in R {or Sj T) with coefficients in 
F, If R and S are roots of the same equation irreducible in FjVi may 
he derived from Ui hy replacing R by S, 

1 Jordan, Traits des Substitutions, 1870, 114r-26, for transformations with integral 
coefficients taken modulo p of determinant prime to p. For any field, Dickson, 
Amer. Joiar. Math., 24, 1902, 101-8. 
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Before giving a proof leading to explicit formulas for the 
. . . , which imply (30) and their further properties, we shah verify 
a fortiori that (29) is similar to (30), when F is the field of all 
complex numbers, and hence can be reduced to (30) by the intro- 
duction of new variables , formulas for which are, 

however, not found by this method. 

The X-matrix of the transformation in the first line of (30) is 


R - \ 0 0---00 0 

1 - X 0 ••• 0 0 0 


[ 0 0 0*--01i^~X 


Let A 2 and Az be the X-matrices of the transformations in the 
second and third lines of (30). Thus the X-matrix of (30) is 


M = 


Ai 0 0 0 

0 A 2 0 0 ••• 

0 0 As 0 ••• 


The determinant of Ai is di = (J? — X)% while the minor of the 
last element of the fiirst row of Ai is unity. The determinants of 
A 2 , A 3 , . . . are (k = (S — dz = (T — X)% . . . . Then the 
determinant of M is A = da ^3 . . . , while the first minors of M 
include do ds d 4 . . . , di ds d 4 . . . , , etc. Their greatest 

common divisor is 1, since it has none of the factors E — X, 
/S — X, . . . . Again, the greatest common divisor of the first minors 
of the X-matrix of (29) is 1, since the minor of hi in (17) is 1. 
Since the X-matrices of (29) and (30) therefore have the same 
invariant factors 1, . . . , 1, +A, they are similar by Theorem 12. 
In other words, there exist new variables Ui,Vi, . , , whose intro- 
duction reduces (29) to (30). 

The properties of the Ut, Vi, . , . stated at the end of Theorem 13 
are important in various applications, but are not used in the 
present text. Some readers may prefer therefore to ondt the fol- 
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lowing proof of those properties in connection with a direct deduc- 
tion of (30). 

We again start with (29), in which the a, belong to the field F. 
As in (18), the characteristic determinant of (29) is the product of 
(— 1)® by Co(X), where 

a — k— l 

(31) Ca-h{\) = — X) (ifc = 0, 1, - . . , a — 1). 

1=0 

Write Co (X) = 1 as the definition of (31) when fc == a. Then (29) 
replaces the function 

/(X) =i:a:,Ca-.(X) 

by 

a 

Q(X) =S a;, Qi -h aiXi, 


Qj = Ca_,-+i(X) 4- a,- = X®-»+i — X‘ -f a, = XCa_,-(X). 

i“0 

Hence 

(32) Q(X) - X/(X) = Xi[ai - XCa«i(X)] s - xiCaM, 

where the last reduction follows from the preceding result for 
j = 1. By t differentiations with respect to X, we get 

(33) (X) - (X) - (X) = cci (X). 

Let B be a root ofCo(X) = 0of multiplicity r, and write ui for 
/(i?), and Ut+i for the quotient^ of (R) by tl Since the trans- 

^ The division is exact for each coeflBLcient. The entire proof is valid also when 
F is a so-called modular field in which the result of adding any element to itself p 
times is the element zero. An example is the field of the p residues of integers 
modulo p. 
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formation (29) replaces /(X) by Q(X) and hence replaces /^^^(X) 
by (X) for every X, it replaces UtJ^i by (R) which we 
therefore denote by Um. Taking X = J2 in (32) and (33), we get 

Ui = Rui, U = Rut+i “h y>t (i = 1, * . . , r *“ 1). 

Writings for ^ + 1, we obtain the first line of (30). By their origin, 
the Ui are linear functions of Xi, ... ,Xa whose coefficients are 
polynomials in R with coefficients in F. 

It remains only to prove that the new variables are linearly 
independent. If 

^ J 

(34 ) ^ ^ ff 7 ^7 “h hj V j *4“ Jo j w j = 0, 

7«1 

identically in xi, ... ,Xay we are to prove that the constants 
Qj X, Ic are all zero. From the corresponding identity in the U, V, 
TV, we insert the values (30), subtract the product of (34) by J2, 
and get 

r s s 

^ ^ Q 7 ^ 7—i " 4 “ ^ 7 (^ R^^ 7 hjV 

j #«2 7 *1 7 

(35) 

t t 

+ -B)^/ + = 0. 

2-1 2—2 

If we can prove that the coefficients of Ui, Vi, Wi in (35) are all 
zero, we shall have = 0 by Wt, then kt^i = 0 by Wt-i , . - . , 
Ai = 0 by Wij and similarly every ft* = 0 by Vi and every 0 
(j > 1), whence (34) becomes giUi = 0. But the coefficient of 
Xa iri ui = f{R) is Co(i2) == 1. Hence also gi = 0, so that all 
coefficients in (34) would be zero. 

Since (35) is of the form (34), but lacks Un it suffices to prove 
that a relation of type (34) which lacks Ur has every coefficient 
zero. The new identity (35) now lacks both Ur and Ur^i. Taking 
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it in place of (34), we see by repetitions of this argument that the 
problem reduces to one involving only the Vi and In the 
corresponding identity in V^ and Wt, we insert the values (30), and 
subtract the product of the former identity by S. Then as before 
we eventually obtain a relation involving only the Wi. This time 
we subtract" the product by T, 

54. Elementary divisors. In the classic canonical form (30) of 
(29), we saw that the characteristic determinants of the partial 
transformations, given by the successive lines of (30), are, apart 
from sign, 

(36) (\-Ry, (X~.T)S..., 

whose product, apart from sign, is the characteristic determinant 
A(X) of (29), i.e., its unique invariant factor distinct from 1. 
These divisors (36) are called the elementary divisors of A(X) or 
of the X-matrix of (29). Recalling the definition (19) of the 
invariant factors of the X-matrix of any canonical transformation, 
of which our (29) is only one partial transformation, we obtain 

Theorem 14. The elementary divisors of the \-matrix of any 
linear transformation are obtained from its invariant factors other 
than 1 by taking the highest 'power of each linear factor of each such 
invariant factor which is a divisor of the latter. Conversely, from a 
list of the elementary divisors written in any order, we may determine 
uniquely the invariant factors I, other than 1 by usi'ng the fact that I ^ 
divides I 

For example, let the invariant factors be 

J, = 1, Jo=X, J3=X(X~1)2, l4r=X2(X-l)3. 

Then the elementary divisors are 


xs X, X, (X - ly, (X - 1)^ 
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Conversely, given the latter, we deduce I 4 as the product of the 
highest powers of the distinct linear functions, then J 3 as the 
product of the next highest powers, etc. 

The classic canonical form shows that there exists a linear 
transformation on n variables whose X-matrix has any preassigned 
elementary divisors the sum of whose degrees is n. 

Our definition of elementary divisors applies only to X-matrices 
If X/. In §52 we defined the invariant factors of P = pM 
+ criV, where M and N are any n-rowed square matrices. All the 
1 3 distinct from 1 may be expressed as products of powers of the 
same linear functions of p and <7, no two of which have propor- 
tional coefficients. Then those powers which have exponents > 0 
are called the elementary divisors of P. 

Exercises 

1. The three types of canonical linear transformations on three variables 
are 

U = Ru, V — Svy W ^ Tw; 

TJi = Ruij TJi = Rti2 y'lj V == Svj 

TJi = Rui^ TJ% = Ruz -i“ ~ Ruz 4" "^ 2 - 

2. Find the elementary divisors of the X-matrix of each transformation in 
Ex. 1. 

3. Verify that formula (28) for any rotation in space is equivalent to the 
equation p = qwq' in real quaternions 

5 = dJ “h ai i + (i2j Qzhj w — vhj + Wz Jc, 

p ^ Vii + Vzj + PzJCf 

where q is of norm unity. See Ex. 6. 

4. The equation ^ = xy in matrices 



defines a homogeneous linear transformation on xi, . . . , Xi which evidently 
leaves invariant the quadric surface S defined by ] a; | == 0; verify that it 
leaves invariant each of the lines 
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Lk'. rri = kxzj X2 = kxA 
on S. Similarly, ^ — yx leaves invariant each line 

Zfc • 2/1 JiiXif X^ *” JcXij 

also on S. Then^ ^ = zxy leaves S invariant, and permutes the lines Lk 
amongst themselves and likewise the lk- 

5. In Ex. 4 introduce new variables Xr and new parameters Yr such that 
S becomes ZXr^ = 0 : 


Xi == Xi + iXij X 4 = Xi — iX 4 f X 2 = — X 2 “h iXsj X 3 = X 2 4" 

2/1 = E4 — iYij 2/4 = F4 + iYi, 2/2 = Es 4 “ iY2f 2/3 = Es 4 - ^Eg. 

Then to ^ - xy corresponds H = XE in quaternions X — Xi^ + Xaj 
+ X 3 4“ X 4 /. Thus if Z and E are real quaternions, S = ZXY gives every 
real linear transformation of positive determinant which leaves SXr® 
invariant up to a constant factor.^ 

6 . Deduce from Ex. 5 the real orthogonal transformations on three vari- 
ables. Take X 4 = 0, Si = 0. In other words, take X4-X'==0, S4-S' = 0. 
Hence shall EXE = EXE'. Write p for Z' E-^ Then p' X = Xp. The 
case X = i shows that the coefficient of “i in p is zero. By symmetry^ the cases 
X — j and X — k show that the coefficients of j and A; in p are zero. Hence 
p = rJ, where r is a real number. Thus Z' — rY and S = ZXZ' 
Taking norms, we see that the condition that Xi^ 4- Xi 4- Xs^ be invariant 
is n(Z) = ± r, which is therefore positive. Write q for the real quaternion 
Z( ± r)“l of norm unity. Thus S == ± qXq\ The sign is in fact 4- or~, 
according as the determinant of the orthogonal transformation is 4” 1 or — 1* 
This gives a new proof of Ex. 3. 

^ These transformations and their products by {xi xzi . which interchanges the 
two sets of lines, are the only linear transformations leaving S invariant. Dickson, 
BuH. Amer. Math. Soc., 22, 1915, 53-61. 

®For an elegant proof using only quaternions, see Hurwitz, Zahlentheorie der 
Quatemionen, Berhn, 1919, 62-66. 
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PAIKS OF BILINEAE, QUADRATIC, AND HERMITIAN 

FORMS 

55. Historical note. For a pair of bilinear or quadratic forms 
(the second being non-singular), the theory was first obtained by 
Weierstrass^ by developments in series, which introduce certain 
square roots. There was a serious gap in his proof. For pairs of 
bilinear forms, all irrationalities are avoided by the method of 
Frobenius.^ However, the resulting rational criterion for the 
equivalence of pairs of bilinear forms m any field may be proved® 
by simple modifications of Weierstrass’s method. We shall give 
here a still simpler rational method based on the theory of linear 
transformations. 

56. Equivalence of two pairs of matrices. Two pairs oi nXn 
matrices M, N and Uj 7, with elements in any field F, are called 
equivalent in F ]i and only if there exist non-singular n-rowed 
matrices P and Q with elements in F such that 

(1) PMQ - U, PNQ = 7. 

We shal first treat the case in which N and 7 are non-singular 
(cf. §57). Suppose that two such pairs are equivalent. Then 
= J is similar to (77”^ = W since 


P/P-i = PMQ • AT-i P-i = C77-^ = T7, 


^Berlin. Berichte, 1868, 310; Werke, II, 19. 

® Jour* fiix Math., 86, 1879, 146-208 (see p. 202). For a report on this and related 
pai)ers, see the author’s History of the Theory of Numbers, III, 1923, 284r-8. 

® Hickson, Trans. Amer. Math. Soc., 10, 1909, 347-60. With the same modifica- 
tions, the method was extended to Hermitian forms by M. I. Logsdon, Amer* 
Jour. Math., 44, 1922, 254-60. 
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so that the X-matrices of J and W have the same invariant factors. 
By §52, the X-matrix J — XJ of J has the same invariant factors 
as 

(J - \I)N = {MN-^ - }J)N - If - XiV. 

Likewise, the X-matrix If — XT of TV has the same invariant 
factors as U — XF. Hence M — XiV and U — W have the same 
invariant factors. 

Conversely, the last result implies that the X-matrices of J and 
If have the same invariant factors, whence J and If are similar in 
F, In other words, there exists a non-singular matrix PmF such 
that PJP-^ = W, Then 

P(J - Xl)P“i = F - XI, 


- XJ)P-i = 177-^ - XJ, 


P{M - \N)N-^P-^ = (U - X7)F“L 
Writing Q for AT*"^ P*"^ F, we get 

PiM-}J^)Q^ U-XF. 

By the terms free of X and those containing X, we get (1). 

Theorem 1. Two pairs of n Xn matrices M, N and 17, F, with 
elements in any field F, such that N and V are non-singular, are 
equivalent in F if and only if M — \N and U — XF have the same 
invariant factors, or, if we prefer, the same elementary divisors. 

We evidently have the same criterion for the equivalence in F 
of two pairs of bilinear forms with the matrices M, N and 17, F. 

57. Next, we omit the assumption that the determinant ] AT [ of 
N is not zero, but assume that ] M + xN ] is not zero identically in 
X . Thus we can select a number v 9 ^ 0 of F such that B ^ M 



114 


PAIRS OF FORMS 


fCh.VI 


+ vN is not singular. Then if the pairs M, N and U, V are equiva- 
lent in F, also the pairs M, B and U^C = TJ + vV are equivalent 
in Fj and C is not singular, and conversely- Hence by Theorem 1, 
the pairs ikf , N and C7, Y are equivalent in F if and only if Af — XB 
and U -- \C have the same invariant factors or the same ele- 
mentary divisors. The latter property holds if and only if the 
matrices pM + <rN and pU + crV have the same elementary 
divisors, as will be proved by means of the following 

Lemma. If A — pM + qN, B = uM + vN^ where pv — qu ^ 0, 
so that pA + tB = pM + (tN for 

(2) pp + UT, cr = qp + VT, 

then the elementary divisors of pA rB are derived from those of 
pM + aN by the transformation (2). 

Let mp + be a factor of aU j-rowed determinants of pM 
+ dN and let e be the exponent of the highest power of wp + 7 i<r 
which divides all those determinants. The transformation (2) 
replaces any element of pM + aN by the corresponding element 
of pA + tBj and hence replaces any J-rowed determinant of 
pM + dN by one of pA + rB. Hence if (2) replaces mp + Ud by 
ap 4- br, {ap + 6t)® is a factor of all ^-rowed determinants of 
pA + tB, It is, moreover, the highest power of a/x + &r which 
divides those determinants. For if they were all divisible by 
{ap + 6t)^, where g > we see by appljdng the inverse of trans- 
formation (2) that (mp + would be a factor of all j-rowed 
determinants of pM + dN, contrary to hypothesis. The Lemma 
now follows from the definition of elementary divisors in §54. 

We may employ the Lemma when p — u = 1, q = 0, v 0, 
Then A = Af , B = M +- vN> Hence the elementary divisors of 
pM + tB are derived from those of pAf + dN by the transforma- 
tion p — p + r, d = vr. Thus, if (7 = (7 + vY^ then pM + tB 
and pU + tC have the same elementary divisors if and only if 
pAf + dN and pC/ ■+• dV have the same elementary divisors. 
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Taking ^ = 1, r = — X, we have the notations employed in the 
discussion preceding the Lemma. This proves 

Theorem 2. Let M, N, U, V be any n-rowed square matrices 
with elements in any field F such that neither of the determinants 
I pM + (tN I and \pTJ (tV\ is zero identically in p and a. Then the 
pair M, N is equivalent in F to the pair V if and only if pM + (tN 
and pU + aV have the same invariant factors {or the same elementary 
divisors). 

For brevity we shall speak of the invariant factors of pM + aN 
as the invariant factors of the pair of bilinear forms having the 
matrices M and iV, and speak of the case in which | pM + cN j is 
not zero identically in p and c as the non-singular case. Then 
Theorem 2 may be stated in the following briefer form: 

Theorem 3. In the non-singular cascj two pairs of bilinear forms 
in Xij ... , Xnj yi, • • • ^yn with coejficients in any field F are equiva- 
lent under non-singular transformations in F if and only if they have 
the same invariant factors. 

58. Canonical forms of a pair of bilinear forms. Let their ma- 
trices be M and iV, where N is assumed for the present to be non- 
singular. The pair Af, N is equivalent to the pair J = MN~~^y 
= J, If p and Q are non-singular and such that PJQ = Wy 
PIQ = I, then Q = P”^. Hence the pair J, I is equivalent to the 
pair IF, I if and only if PJP-^ = W, i.e., if J and W are similar 
matrices. 

When W is the matrix of the canonical form of the linear trans- 
formation of matrix J = MN~^, we shall call W, I the canonical 
pair of Af , N. 

When all the elements of Af and N belong to a field F, we may 
choose W to be the matrix of the canonical form with coefficients 
in F of Theorem 1 of Ch. V. This proves 

Theorem 4. Any pair of bilinear forms in Xi , ^ Xn , Vh * ^Vn 
with coefficients in any field F, such that the second form is non- 
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singular, can he reduced by a non-singular linear transformation on 
the x’s and one on the y’s, each with coefficients in F, to a unique 
canonical -pair 


^ g — 1 a V y . 5— 1 & ^ 

/ = (2 XiPi+i+Xa'^ at Zi Wi+iH-Ss^S , 

' £*=1 £»=1 

a h 

g = J^Xiyi + '^ZiV}i-\r • - ‘t 

£»1 i=l 

where a + 6 + • • • = n. If there are k of the numbers ^ ,the 

invariant factors of f — \g are • • • ? ^7 In’^k+ly • • • > 

h a 

In-l — yf hi /„ = X“ — 2 Ci 

£=1 £==“1 

Since the at, 6t, . . . are any numbers of F such that J,* divides 
I f+i for every j < n, we have the 

Corollary. There exists a pair of bilinear forms f, g inn + n 
variables with coefficients in F such that / — Xg has any preas- 
signed invariant factors J,* with coefficients in F such that If 
divides I/+i. 

Next, if we choose W to be the matrix of the classic canonical 
form of Theorem 13 of Ch. V, we obtain 

Theorem 5. Within the field of all complex numbers, any pair of 
bilinear forms in n + n variables, of which the second form is nonr 
singular, can he reduced to the classic canonical pair 
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where r + s + ••• = «. The elementary divisors of 4> — Xf' are 
(X-S)*,... 

CoROLLABY. There exists a pair of bilinear forms <j> and ^ in 
n + n variables, of which \j/ is non-singular, such that (f> — X\p has 
any preassigned elementary divisors the sum of whose degrees 
is n. 

We now omit the assumption that \n \ 9 ^ Oj but assume that 
I pM + aN I is not zero identically in p, a. Choose u and z; in F so 
that B = uM + vN is non-singular. Then choose p and q in F 
so that pv — qu = Ij and write A == pM + qN, Then 

(3) M ^vA-qB, W = - wl + pB. 

The pair of bilinear forms with the matrices A and B can be 
reduced rationally to a pair / and g of Theorem 4. The same trans- 
formations reduce the pair with the matrices M and N to the pair 
c = vf — qg and d = — uf + pg. We have pc + ord — pf rg 
for 

(4) ii-=- vp - Ud, T = -qp + p<r, 

which give the solved form of (2). The invariant factors of 
Mf + '^g reduce to those oi f — \g by writing m = 1; = ““ X- 

Conversely, if we make the invariant factors of / — Xg homogene- 
ous in X, Pj and then replace X by — r, we get the invariant fac- 
tors of pf + rg. The transformation (4) replaces the latter by 
those of pc + (xd. This proves 

Theorem 6. In the norirdngidar case, any pair of bilinear forms 
in the variables a;i, . . . , Xn^ yi, , . . , with coefficients in any field 
F, can be reduced by a non-dngular linear transformation on the x^s 
and one on the y^Sj each with coeffiwients in F, to a canonical pair 
c = vf — qg and d = — uf + pg, where v, g, % p are numbers in F 
such that pv qu = 1, while f and g are the forms in Theorem 4. 
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The invariant factors of pc + ad are obtained by applying Ivans'- 
formation (4) to those of pf + rg, viz,, 

1, . , . , 1 , In—k+l} • • * y 

I«-l = T® — ^ 6,+l T»(— 

j=0 

a— 1 

I „ = — 23 0/+1 ■>■’ (— 

3 =a=0 

If (X — jB)’‘isan elementary divisor of A — \B, then (r 
+ pR)^ is one of pA + rB, and conversely. By the Lemma in 
§57, the inverse (4) of transformation (2) replaces an elementary 
divisor (r + pR)'‘ of pA + rB by an elementary divisor (mi p 
+ nicr)^ of pM + (tN, Since only the ratio of mi and ni is material, 
we may take mi w + ni a; = 1, Then if = mi p + we have 
mi p + ^1 0 - = r + pR. By Theorem 5 we can reduce the pair of 
bilinear forms with the matrices A and B to the canonical pair 
<l> and 4^. Then that with the matrices (3) is reduced to the pair 
a ^ — q-p and = -- u<j> + VPy whose expressions reduce to 

those given in Theorem 7. For, pv — qu = 1, mi w + ri-i a; — 1, 
mip + niQ — R imply mi vR — q, rii = p — uR; and, when 
we employ S instead of R, we replace the subscripts 1 by 2 and 
hence have m 2 = vS — q, rh — p — uS, 

Theorem 7. In the non-singular case, any pair of bilinear forms 
inn + n variables can be reduced to the classic canonical pair 

cc = (miX 

"b rji Pi + + • • • » 
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i=»l i=»2 ^ 

f ^ ^ Vi Vi ^ 'Q^y »— ly H~ • • • • 

The elementary divisors of pa + cr^ are 

(mi p + nicr)^j (m P + ri2 (t)% 

CoKOLLART. There exists a pair of bilinear forms a and ^ in 
n + n variables such that pa + <r^ has a determinant which is not 
zero identically in p, o-, and has any preassigned elementary 
divisors the sum of whose degrees is n. 

Exercises 

1. Two q\iadratic forms with the matrices A and B have as invariants of 
index 2 the coefficients of the various powers of X in the determinant of 
A -\B. 

2. By means of Theorem 5, write down the three types of canonical pairs 
of bilinear forms when n = 3. 

59. Pencils of bilinear forms. Instead of a pair of bilinear forms /x 
and Vj with coefficients in F, consider the pencil [y, v] composed 
of aU bilinear forms pp + av, where p and cr range over F. This 
pencil coincides with the pencil [a, /3], where 

(5) a = pp + qvf P — up + vv, pv — qu 9^ 0. 

The pencil is called non-singular if the determinant of pp -|- (rv 
is not zero identically in p, cr, 

A second such pencil, [pi, ^i], is equivalent in F to the pencil 
[py v] if and only if the pair pu n is equivalent to some pair (5) 
of the pencil [p, v]. 

In view of the Lemma and Theorem 2, we have 
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Theoeem 8. Let Nj Mi, Ni be the matrices of the bilinear 
forms jXj V, /xi, vi with coefficients in any field F, The non^singular 
pencils [/i, j'] and [mij ^'i] equivalent in F if and only if all the 
invariant factors (or elementary divisors) of pi Mi + ori Ni can be 
derived from those of pM + aN by the same non~singular linear 
transformation expressing p and a linearly in terms of pi and <ri with 
coefficients in F, 


60. Tke nth roots of a matrix. We shall need the 
Lemma. If a,b,Cj . . . are distinct and not zero^ and 

4/{x) = {x -- a)^ {x — b)^ {x — c)K . . , 

there exists a polynomial x(^) that [x(^)]’‘ ^ is exactly 

divisible by \l/{x). 

For example, if n = 2 and 1/ ^ {x — a)^, then 

X = + a), - X ^ {x - a^. 


To prove the Lemma, consider the expansion (by the binomial 
theorem) of in a series of ascending powers of x — a: 


^ X ^ a ^ 1 + (x — a) /a 



g; — g 
na 


+ 


(1 — n) {x — a)2 
2n^ aP- 



and write F{x) for the sum of the first r terms, so that 
— F(x) is termwise divisible by {x — ay. Similarly, let G{x) be 

the sum of the first s terms of the expansion of in a series of 

powers of a; — 6, so that G{x) is divisible by {x — 6)®; 

etc. 
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By decomposition into partial fractions (or by expansion in a 
series of ascending powers of a; — a), let 


Fix) 

4'{x) 


Ao 

[x — a)’* 


+ ••• + 


A r—l 
X — a 


+ R{x) 


A{x) jx - a)^P(x)^ 
(x-a)^ i^ix) 


where the Ai are constants and R{x) is the quotient of a poly- 
nomial P{x) by i^(x)/ix — «)»*. Multiplication by 4^{x) shows 
that the j&rst r terms in the expansion of \f'{x)A{x)/ (x — a)’* 
in a series of ascending powers of x — a have the sum F{x) and 

hence are the same as the first r terms in the expansion of 

In other words, the difference between this fraction and js 
divisible by {x — a) 

Determine B(x) from G(x) just as we determined A(x) from 
F (x). Then the difference between and the second term of 


^ }p(x)A(x) ^(x)g(x) 

(x — a)*" (x — b)* 


is divisible by (x — 6)*. The first, third, . . . terms are divisible 

by {x — b)‘ since ^(a;) is. Hence x — ® is divisible by 
(x — 6)* and similarly by (x — a) , and hence by their 
product ^(x). 

Theorem 9. If X is a non-singvlar square matrix, ffiere exists 
a 'polynomial x(^) that the nth povxr of the matrix x{X) 
is X. 

If <#>(X) is the characteristic determinant of X, then <p(X) = 0 
by §25. Since X is non-singular, the constant term of ^> (\) is not 
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zero. Hence we may employ (X) as the (X) of the Lemma and 
conclude the existence of polynomials xW ^^at 

IxMY - X s .^>(X)9(X), 

identically in X. Since the coefficients of like powers of X in each 
member are equal, and since X’ = X*+', we conclude that 

[x(X)]" - = 4>(X)q{X) = 0. 

61. Equivalence of pairs of qtmdratic or Hermitian forms, or 
symmetric or Hermitian bilinear forms. We saw in §56 that the 
theory of equivalence of pairs of bilinear forms may be developed 
rationally in any field. But this is not true of pairs of quadratic 
forms, in the investigation of which we shall employ Theorem 9 
and hence introduce irrationalities. Whatever method be em- 
ployed, irrationalities caimot always be avoided, as is clear from 
the following example. The real quadratic forms — 2y^ and 

— 3y^ are transformed into 2* '-1- and ^ -|- 3v? respectively 
by the transformation x = z,y — {— l)^w, but by no real trans- 
formation, since the first forms have real linear factors while the 
second forms do not. We shall therefore study equivalence with 
respect to the field of all complex numbers. 

Consider four bilinear forms a, /3, y, d whose matrices A, B, 
C, D all have the property X' = X. By Theorem 7 of Ch. Ill, 
the pair a, ^ is equivalent to the pair y, d xmder a non-singular 
transformation on the a:’s with matrix E and the transformation 
on the y’s with matrix E if and only if 

(6) C = E'AE, D = E'BE, \E\^ 0. 

According as X denotes X or X, onr forms are symmetric or 
Hernntian bilinear forms. If we identify with for j = 1, . . , 

we conclude that two pairs of quadratic or Hermitian forms are 
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equivalent if and only if their matrices satisfy (6), where E = E 
or E in the respective cases. 

Theohem 10. If the four n-rowed square matrices A, B, C, D 
have the property X' = X, and if there exist non-singular n-rowed 
matrices P and Q such that 

(7) C = PAQ, D = PBQ, 

then there exists a non-singular nrrowed matrix E for which equations 
(6) hold. 

By hypothesis, 

(8) C = C' = Q' A' P' Q' AP'. 

Equating this to C = PAQ, we get 

(O')-' PA = AP’ Q-K 

Write 

(9) Z=(0')-'P, 

so that X is non-singular. Then 

X' = P' Q~\ 

whence 

XA = AX', X^A = XAl' = A0C'y. 

By induction on k, we get 

(10) X’‘A=AiX')K 

Let x(^) = S be any polynomial in x. Multiply A = A 
by Co, and (10) by c* for ft = 1, 2, ... , and add; we get 

(11) x(^)A =Ax(l'). 
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By Theorem 9, we may choose complex numbers Ck so that the 
square of the matrix F = x(-^) is equal to X. Since X is non- 
singular and = Xj also V is non-singular. Then (11) gives 

A = F-i Af'. 


Thus (7i) becomes 

C = PV-^Af'Q. 

Write E =V'Q. From = X and (9), we get 0' F = PV^K 
Hence C E' ASJ, This proves the first equation (6). 

Since X and F and hence also E depend only on P and Q, but 
not on A or C, the second equation (6) follows similarly from the 
second equation (7). 

Theorems 10 and 3 evidently imply 

Theorem 11. In the nonrsingular case, two pairs of symmetric 
(or Hermitian) bilinear forms are equivalent under a nonrsingular 
linear transformation on the x^s and the cogredient (or conjugate) 
transformation on the y^s if and only if they have the same invariant 
factors. 

From this and Corollaries 1 and 2 of §§33, 34, we have 

Theorem 12. In the nonrsingvlar case, two pairs of quadratic or 
Hermitian forms are equivalent under norinsingular transformation if 
and only if they have the same invariant factors. 

Consider the case in which the second quadratic or Hermitian 
form of each pair is ^ Xi Xi, whose matrix is the identity matrix I. 
Hence Theorem 12 implies the 

Corollahy. Two quadratic (or Hermitian) forms with the 
matrices A and C are equivalent under an orthogonal (or unitary) 
transformation if and only if the X-matrices A — X/ and C — Xf 
have the same invariant factors. 
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The conditions that two pairs of real quadratic forms shall be 
equivalent under real transformation are complicated.^ 

Exercises 

1. Every binary non-smgxxlar quadratic form is equivalent under an 
ortbogonal transformation (not necessarily real) to one of the following: 

ax^ + (a 7^ 6 ) ; ax^ 4- 2ax^ + 2aixyf 

whose invariant factors are (a — X) (5 ~ X); X — n, X — a; (a — X)®. 

2. In the non-singular case, two quadratic forms / and h can be reduced to 
forms involving only squares of the variables if and only if their elementary 
divisors are all of the first degree. Hence the latter is true by §37 when / is a 
real quadratic form and h is positive and non-singular. State the correspond- 
ing theorems for Hermitian forms. 

62 . Pairs involving alternate forms. A bilinear form with the 
matrix A is called alternate if A' = — A, and alternate Hermitian 
if A' - - I (cf. Exs. 4, 5 of §38). 

Theorem 13. Theorem 10 holds also when 

A' = - I, c = -G, B' = ±B, D' = ±D. 

For, (7i) then implies 

C = - C?' = - Q'A'P' = + Q'AP', 

so that (8) holds also here. The rest of the proof of Theorem 10 
applies here unchanged. 

Theorems 13 and 3 imply 

Theorem 14. In the non-singular case, two pairs of forms a, p 
and y, d, of which a and y are alternate bilinear forms (or alternate 
Hermitian forms), while p and d are either alternate or are symmetric 
(or Hermitian) bilinear forms, are equivalent under a non-singular 
transformation on the x^s and the cogredient (or conjugate) transfor- 
mation on the y^s if and only if they have the same invariant factors. 

^Muth, Jour, fiir Math., 128, 1905, 302-21. For pairs of binary quadratic forma 
in any field, see Dickson, Amer. Jour. Math., 31, 1909, 103-8. 
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If a is a symmetric and jS an alternate bilinear form, any ele- 
mentary divisor of pa + <t/ 3 of one of the special types or 
is one of an even number of equal elementary divisors."^ 

63. Pairs of forms with preassigned mvariant factors. 

Theorem 15. There exists a pair of quadratic (or Hermitian) 
forms <j) and ^ with coefficients in the field F such that 4^ is non^ 
singular and the matrix of <1) — \\l/ has any preassigned invariant 
factors Is with coefficients in F (or in the real suhfield S of F), where 
Is divides Is^ifor every s. 

If there are t invariant factors Is distinct from unity, we shall 
prove that we may take <?^> = 23 23 where as — 

has the single invariant factor Is distinct from unity, and where no 
variable of as or fis occurs also in ai or pi for i 9 ^ s (cf. the proof 
in §43). In the further discussion we shall omit the subscript s. 

For an odd number 2n — 1 of variables, we may take 

^ n 2n-2 

^ 2 V an X% Xx HH 2 j Xx^x ^2n—'i 

ta=i ^=*1^ 

n-1 

(12) ^ +E (a i i+1 Xi Xt-fl + ^i+1 % ^1+1 Xi)j 

l==l 

2n-l 

P 23 ^2n— t; a^i == Uij, 

We shall prove that the determinant An of a — \p has the value 

A„ = (- 1 )»-![- +f)aii X2*-2 

l='l 

« - 

+ 23 (®i-l i + ax • 

1-2 

^Kronecker, Monatsber. Akad. Berlin, 1874, 397; Werke, I, 423. Proof by Fro- 
bemus in Encyclop6die des Sc, Math., 1. 1, voL II, 463-9; and in Muth’s Elementar- 
theiler, 1899, 135-142, 231-2. Stickelberger, Jour, fiir Math., 86, 1879, 42-43. 
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We have 



du 

0 

0 

(hi 

022 

(123 

0 

0 

(I 32 

a33 

dSi 


0 0 - X 

0 - X 1 

- X 1 0 


0-X10--0 0 0 

~X 100---0 0 0 


in which every element of the secondary diagonal is — X, except 
the middle element ann — X, while every element just below that 
diagonal is 1, and the further elements below it are zero. Hence the 
minor of an is + 1. Multiply the first row by 1/X and add to the 
second row; then multiply the first column by 1/X and add to the 
second column. Now — X is the only element 0 in the last row 
and last column. Hence ~ X^ d, where d is the determinant 
obtained from by deleting the first and last rows and columns 
and replacing a 22 by 


&22 = CL22 + (O'n + U2l)/X + C^ll/X^. 

We see that d is derived from An_i by replacing every a^j by 
Ui+i, 3+1 except an, which is replaced by 622 . In a proof of the for- 
mula for An by induction on n, we assume the formula when n is 
replaced by n — 1. In the latter we make the preceding replace- 
ments and hence get d; its product by — X^ is seen to reduce to 
the expression for An. Finally, Ai = an — X is the determinant of 
(an — \)xi xi, to which a — XjS reduces when n = 1. Hence the 
induction is complete. 

Hence the matrix a ~ XjS has the invariant factors 1, . . . , 1, 
±An. By choice of the a^/, ±An can be identified with any given 
polynomial in X of degree 2n ~ 1 whose leading coejEcient is 1 and 
the remaining coefficients g have the property g = g- 
For an even number 2n of variables we employ 
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'' n 

71—1 

“j“ (fl^t i+l S'i ^%JrX “f“ * ^t+1 ^0 

i.“l 

(13) ^ 

2n-l 

+ S ^2n41-£ ^2jt “f* 

i-l 

2n 

/3 = S ^2n+l-i. 

i-1 

For w = 1, the determinant of a — X/3 is 

(14) - ^ 1 ^ = — [X* — (s' + ^)X + firg — ail]. 

For n > 1, the determinant A of a — X/3 is 

Ou aia 0---0 0 0 0 0 0---0 0 g-\ 

ci 22 ^22 ***0 0 0 0 0 0*‘*0 — \1 

0 032 O33 ••"0 0 0 0 0 0***“~-Xl 0 


0 0 0 * * * On^ln — 2 0>n—ln—l On— in 0 Xl***0 0 0 

0 0 0 •••0 Onn-l Onn -X 1 0 •••0 0 0 

0 00**-0 0 -X 1 00---000 


0 - xl *-*0 0 0 0 0 0---0 0 0 

g-\ 10“-0 0 0 0 0 0---0 0 0 

Multiply the (n + l)th row by X and add to the nth row; then 
multiply the (n + l)th column by X and add to the nth column; 
we get a like determinant having a„n replaced by bnn = an.n — X^, 
and having the terms — X in the nth and (n + l)th rows replaced 
by 0. K n = 2, we have (16) with 622 = 022 — X^. 
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If n > 2, multiply the nth row by X and add to the (n — l)th 
row; then multiply the nth column by X and add to the (n — 1 )th 
column; we get a hke determinant having an^i n--i replaced by 

(15) = 0>n-l n-l + X(aft-_i « + Un n-l) + X® h^n, 

having X5nn added to both an »-i and Un-i n (which does not effect 
our final result), and having the terms — X in the (n — l)th 
and (n + 2)th rows replaced by 0. Thus A is the negative of the 
determinant obtained by deleting the rows and columns numbered 
n, n + 1, and n + 2. However, we shall continue to number the 
rows as in A. If n = 3, we have (16), where 622 is given by (15). 

If n > 3, multiply the (n — l)th row by X and add to the 
{n — 2)th row; multiply the (n — l)th column by X and add to 
the {n — 2)th column; we get a like determinant having an ^2 n~2 
replaced by 

^n— 2 n— 2 “ Ctn— 2 n— 2 "t” X (cin—l n— 2 “4" ^n—2 n— l) “f" X^ 

having X6n-i n-i added to both a^-i and n-i, and having 
the terms — X in the (n — 2)th and {n + 3)th rows replaced by 
0. Thus A is equal to the determinant obtained by deleting the 
rows and columns numbered n — 1, ti, n + 1, 2, n + 3. 

Proceeding similarly, we see that 

Cti2 Q — X 

(16) A = (— 1)’^“2. 5^2 1 

1 0 

Hence 

^ == ( — l)»^[— ail + Ui2 + (hi Q “* (^12 + c^2i)X 

622 (g - x)(g “ X)], 

where 622 may be computed by means of the recursion formula 

bii = an + X(ai t+i + <Xi+i i) + X^ bi+i (i = 2, . . . , n — 1), 
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and bnn = a„„ — XI Thus 

n n— 1 

622 = — X-”-- X“^~^ +X/ "t" ^^4-1 ijX^*-^. 

i=2 1=2 

Hence we obtain the explicit formula 

(17) (- 1)- A = r- - (sr -b g)\^n-i + (^g ^ 

+ [— an~l % — Cln n-l + (^ + ^)^»n]X“"“® 
n~l 

— ““ (^ + ^) xi-i + aj+i i) 

a=2 

n-l 

~ X) [a._i i + a^ x_i + gg (a£ £+i + a^^l ^) - a^i{g + g)]\^^-^ 
i =3 

+ [— ai2 — <221 + (g + g)a22 “ gg (023 + <232)]X 

— ail + <^12 g + <221 g — gg 022, 

where; by (14); only the first three terms are retained if n = 1, 
while if n = 2 we retain the first three and last two terms, and 
omit the last part of the coefficient of X. 

We can identify (17) with any pol 3 momial X^*^ + •••with 
coefficients in F or S, according as x denotes x or x. This may be 
done by choice in turn of the letters written below the proper 
powers of X: 

X2n-l^ ^2n-2^ ^ 2 n- 3 ^ _ ^ 5^23-2^ ^ ^ ^ 

g §} Otnn? ^n-1 n (^n «— 1 ; • • • j j-i-l Ofj+l f, — G-jf, . . . 

where j takes values decreasing to 1 inclusive. We have here 
omitted from the complete coefficient of any the terms involving 
only letters appearing under . . , , in the table. 
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64. Weierstrass’s canonical pair of quadratic forms. Whether 
the number of variables is odd or even, we have exhibited a pair 
a, of forms (12) or (13) whose invariant factors are 1, . . . , 1, 1, 
where 1 is any preassigned pol3naomiaL Omitting the signs - over 
Xy we have the case of quadratic forms. Let 

k 

I = II (X --- X,)-;. 


If each Cj and each is not zero, the pair 

f k ^ ] e j — 1 

T ~ ^X, C j Xtj Xe j — d j X^j Xg 


( 18 ) i 






V. j«=l 


has the invariant factors 1, . . . , 1, J. For, the matrix of 7 — Xd 
has the abbreviated notation 


Ml 0 0 - • • 0 0 
0 M 2 0 • • • 0 0 


[0 0 0 ••• 0 Mfc 


where 0 is a zero matrix, and Mj is the Cf rowed square matrix of 
those terms of 7 — Xd in which j has a fixed value: 




0 0 0 • ‘ * 0 dj c, (X,- - X) 

0 0 0 C/(X, — X) 0 

d, c, (X,- - X) 0 ‘ 0 0 0 

c, (X, - X) 0 0 ••• 0 0 0 


The elements of the secondary diagonal are all Cj(Xj — X), those 
just to the left of them are all d/, while all further elements are 
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zero. Apart from constant factors, tiie determinant of If / is 
(X — Xf )®3 and the minor of the last element of the last row is 
(I jCj-i =: constant 0. As in the first part of §53, the invariant 
factors of 7 — Xd are 1, . . . , 1, 1. Hence the pair 7 ? d is equivalent 
to the pair a, |3. 

In §63, we obtained a pair of quadratic forms <j> 

Ip == having exactly t prescribed invariant factors 5 *^ 1 , 
where a® — has the invariant factors 1, . . . , 1, !«. In place of 
we have now found that we may take a pair 7 *, d^ of type 
(18). Hence X^d^ have the t prescribed invariant factors 

9 ^ 1. Changing the meaning of h, we may now suppose that 
(X — Xj)*/, for j — 1, . . . , fc, give all the highest powers of linear 
functions which occur among all the invariant factors (instead of 
those for a single one I as before). Hence we obtain 

Theoeem 16. There exists a pair (18) of quadratic forms having 
any prescribed elementary divisors 


(X - Xi)% . . . , (X - 


For example, let the number of variables be three. 

(i) 61 = 62 = cs = 1 . Take Ci = 02 = 1, cs = ~* 1 , and write 
aJ/forxij. Thus 

T = Xi ^1^ + X2 ^2^ - Xs Xz\ 6 = xi^ + xi — xi. 

If the X’s are distinct, the conics 7 = 0 and d = 0 have four 
distinct points of intersection given by 

x^ ixi \xi X2 — X3 : Xs — Xi : X2 — Xi- 

If Xi 7=^ X2 = X3, the only intersections have == 0, == ± X3, 

and the conics are tangent at two points. If Xi = X2 = X3 7*^ 0 , 
the conics coincide. 

(ii) Cl = 2 , 62 = 1. Take ci = = di = 1 , and write Xi, X2, Xs 

for Xu, X21, X12. Then 
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7 = 2Xi Xi X2 + Xi^ + X 2 Xs^, d = 2x1X2 + xi. 

If Xi = X 2 , the only intersection has xi = Xz = 0, and the conics 
have contact of the third order. But if Xi 7*^ X 2 , there are two 
fiirther points of intersection having 


Xi = 2(X2 — Xi)a:2, Xz^ = — 4(X2 Xi)x2^, 


and the conics have simple contact at Xi = Xz = 0. 

(iii) ei = 3. Take Ci = di = 1, and omit the second subscripts 
on the a;'s. Then 

7 = Xi(2a:i Xz + X2^) + 2 xi X2, d = 2 xi Xz + 

The conics have contact of the second order at = ^2 = Oj and 
cross at X 2 = xz = 0. 


Exercises 

1 . List the canonical forms of pairs of quadratic forms in four variables.^ 

2. If A and N are any two n-rowed square matrices, there ejdsts an n-rowed 
non-singular matrix X such that X'AX = N (whence A and N are con- 
gruent or conjunctive) if and only if the pair A, A' is equivalent - to the pair 
Nj iV'. Hints: If they are equivalent, there exist matrices P and Q such that 
PAQ — Xj PA'Q = iV'. Add and subtract and apply Theorem 13. 


65. Further applications of matrices to forms, available in 
English. Hurwitz’s identity expressing the product of a sum of p 
squares and a sum of n squares as a sum of n squares.^ 


^Bromwich, Quadratic Forms, 1906, 46, or Hilton, Homogeneous Linear Substi- 
tutions, 1914, 105. 

2 Hence the problem depends upon the singular case. Muth, Elementartheiier, 
1899, 144-51. Encyclop^die des sc. math., t. I, v. II, 471-4. 

®For p ~ n, Dickson, Annals of Math., (2), 20, 1919, 160-4. Hurwitz, Math. 
Amialen, 88, 1922, 1-25. Radon, Abhand. Math. Hamburg Univ., 1, 1921, 1-14, 
found the maximum p for a given n when the matrices are real. Generalization, to 
forms of any. order, Dickson, Congrfes Intemational des Math^maticiens, Stras- 
bourg, 1920, 215-30; Comptes Rendus Paris, 172, 1921, 636-40, 1262. 
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Forms expressible as determinants with linear elements.^ 

Frobenius’s theory of matrices ’whose elements are pol^momials 
in one variable; application to the equivalence of pairs of bilinear 
forms.^ 

Equivalence of two quadratic or Hermitian forms in a general 
field.® 

Singular case of pairs of bilinear, quadratic, or Hermitian forms.^ 

Weierstrass^ gave a simple application of the theory of pairs of 
bilinear forms to the integration of a system of n linear differential 
equations of the first order in n variables with constant coefficients. 

For applications of elementary divisors to linear differential 
equations, see texts on the latter subject. 


1 Dickson, Trans. Amer. Math, Soc , 22, 1921, 167-79 (24, 1922, 1S5; 26, 1926, 
367); Amer. Jour. Math., 43, 1921, 102-34; Annals of Math., 23, 1921, 70-74. 

- Bocher, Introduction to Higher Algebra, 1907, 262-284. Cf. Dickson, Algebras 
and their Arithmetics, 1923, 169-174; revised in the German edition, Zurich, 1926. 

^Dickson, Bull. Amer. Math. Soc., 14, 1907-8, lOS-115 (simpKfication of Trans. 
Amer. Math. Soc., 7, 1906, 275-292); Quar. Jour. Math., 39, 190S, 316-33. 

^Kronecker, Berlin. Sitzungsberichte, 1890, 1225-37, 1375-88; 1891, 9-17, 33-44. 
Simpler treatment by rational methods, Dickson, Trans. Amer. Math. Soc., 1927 
(10, 1909, 358-60). 

® Berlin. Abhandl., 1875; Werke, II, 75-76. 
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FIRST PRINCIPLES OF GROUPS OF SUBSTITUTIONS 


The fundamental concepts of substitutions and groups will be 
introduced in a very concrete and natural way in connection with 
the solution of cubic and quartic equations. The reader will there- 
fore appreciate from the start some of the reasons why these 
concepts are employed. The deliberate presentation and numer- 
ous illustrative examples will enable the reader to digest these 
somewhat abstract ideas. 


66. Cubic equations. Of various methods of solving 
(1) + bx^ + cx + d = 0 


we shall present the method which best illustrates our later theory 
of the solution of equations of any degree. If Xg, Xs are the 
roots, then 


( 2 ) 


J^Xi^ Xi + X 2 + Xz = —h, 

X1X2 ^ X1X2 + XiXz + X2Xz = C, X1X2XZ = — d. 


Let CO be an imaginary cube root of unity, so that 
(3) CO^ + CO + 1 = 0, CO® = 1. 


As soon Bs we have computed the values of 
(4) <j> = Xi COX2 + ^ ^ Xl 03^ X 2 COX3, 


we can find the x^s by solving these linear equations with the first 
one of (2). First, we add the three equations and get the value 
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of Next, we multiply them by 1 , respectively, and add. 
Finally, we multiply them by a?, or, 1 and add. Using (3), we get 

^ - 5), % = |(a>- ^ — 6 ), 

(5) 

^3 = I ( co<^ + a>“ ^ — 6). 


To compute the values of 4> and we employ 


( 6 ) 


~ "h + cr) X/ ^2 

= ( XI ^ ^2 = — 3c, 


^3 ^ ^3 — 2 X] ~ 3 23 + i2xi X2 xz 

= 2 (X) 9X3 ' 13 ^2 + 273?! X2X3^ B, 

^ £ = - 26^ + 96c - 27d, 

(^3 - ^3)2 == (^3 + ^3)2 _ 4^3 ^3 = ^2 _ 443 , 

Hence is equal to one of the square roots of B- — 4A®. 

Choosing a particular square root and emplojdng also (7), we get 

<i>^ = mb + (b^ - = g, 

^ ^ == mb - (B 2 -* 413)^] == A. 

The cube roots ^ and ^ of g and h must be chosen so that 
^ by ( 6 ). We agree to obtain ^ as a cube root of g and then 
compute ^ as the quotient A/ <l>. 

If ^ is a particular cube root of g, the remaining cube roots are 
o)4> and o}^4»- We therefore consider the functions 


(9) 


(j>l = oxff = Xs + 03Xi + 0)^ X2, 
\pl = CO^ = Xz + 0}X2 + Xl, 


( 10 ) 


(j)2 03^ 4> = X2 A" <^Xz + Xl, 

^2 = 03^ = X2 -jr 03Xi + 03^ Xz. 
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The replacement of ^ and fin (o) by oof and orf, respectively, 
has the effect of replacing Xi by Xsj x-i by Xi^ and Xs by Xa. This 
operation is called a substitution on the roots (or letters ) Xi, xa, X3, 
and is denoted by (xi X3 Xa). Such a cycle signifies that each letter 
is replaced by the letter written just to the right of it, while the 
last letter Xa of the cycle is replaced by the first letter Xi. 

Similarly, the replacement of f and ^ in (5) by 00^ f and 
oof respectively has the effect of applying the substitution 

(Xi X2 X3). 

If we had chosen the other square root of — 4A^, we would 
have obtained — hj f^ = g, instead of (8), The interchange 
of f and f m (5) has the effect of applying the substitution 
(X2X3) which interchanges Xo with X3 and leaves xi unaltered. 
The replacement of f by oof and f by oo^f in (5) has the effect 
of applying the substitution (xi Xa). Finally, the replacement of 
f by 00 ^ f and f hj oof in (5) has the effect of applying the sub- 
stitution (X1X3). 

We may combine the essential parts of these results as follows: 
If we have found a definite value of f (and hence of f = A/ f) hy 
making a choice of the square root in (8) and then a choice of the 
cube root of the resulting g, so that formulas (5) give definite 
values for Xi, Xa, Xs, then a different choice of the square root or 
of the cube root leads to new values of Xi, Xa, X3 which are derived 
from the former values by appl3rmg a substitution on the roots. 
This conclusion might have been anticipated, since if we start 
with the coefficients of the cubic equation and compute the roots, 
we are at liberty to assign the notations Xi, X2, X3 to the roots 
arranged in any order. 


Exercise 

Employing the linear functions (4), (9), (10), verify that the substitutions 
(xi xz X 2 ) and (xi X 2 xz) replace f and f by fi and fij and fz and fz, respec- 
tively, while (xz xa) interchanges f and f, (xi xz) interchanges 4>i and fi, and 
{xi Xz) interchanges fz and Also, (xi xz) replaces <!> and f by ft and 
while (xi Xs) replaces f and f by fi and fa respectively. 
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67. Discriminant of a cubic equation. The discriminant A of 
a cubic equation in which the coefficient of is unity is defined 
to be the product of the squares of the differences of the roots. 
We shall compute it by means of the value — M® of i<f — 

in § 66 . Since the cube roots of unity are oo, and we have 

— 1 = (2 ~ 1 ) (2 — O)) (2 — 03^) 

for all values of z. Taking z = we see that 

^3 — ^3 (6 — — O)’^) (<j) — 0)^ }}/). 

From (4), (9), and ( 10 ), we get 

<f> — ^ - (w — 0)2) (a;2 — Xz), 

(11) 0 — coi/' = (1 — w) {xi — a^), 

0 — ^ = (1 — (^1 — 2 : 3 ). 

We desire the product of the squares of the three expressions 

(11) . By use of (3), we get 

(a> -- oo^y = - 3, (1 - co) (1 - 03^) = 3. 

Hence — 4J.® = — 27 A, This gives 

(12) A == mhcd + - 27(P, 


Exercises 

1. Tbe discriminant ofa:2 + ca;4-d = 0is —40^— 27d^. 

2. Show that — 27a; -f- 54 = 0 has a multiple root by computing its 
discriminant. 
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3. If we define the discriminant A of ax^ + + cx + d = 0 to be 

Pj where P is the product of the squares of the differences of the roots, so 

that P is derived from (12) by replacing 6, c, d by 6/a, c/a, d/a, respectively, 
show that A — 18a6cd — 46® d — 4ac® — 27a‘^ c^. 

4. If I) is the discriminant of the corresponding cubic form axfi 4- y + 
csyf + dip, show by (19) of Ch. I that A = — 27D. 

5. If A 0 and ps 0 (so that 0, ^ ps 0), verify by using (11 ) that 

the six functions (4), (9 ), and (10) have six distinct values. In the Exercise in 
§66, we gave a substitution which replaces by any one of the remaining 
five functions. The identity substitution which replaces each root x^ by itself 
does not of course alter <f>. Since therefore the six substitutions on the three 
roots replace <f> by six functions whose values are aB distinct, is caled a 
six-valued function. 

6. Show that also is a six-valued function. Why does it follow without 
computation that each of the four functions (9) and (10) is six-valued? 


68. Substitutions on n letters. The operation s which replaces 
Xi by Xa, X 2 by Xb, , , . , Xn by Xz, where a, 6, . . . , ^ form a 
permutation of 1, 2, . . . , n, is called a substitution on XijX-j . . . , 
Xn. It is given the two-rowed notation 


(13) 


/ Xi X2 - ’ Xn\ ^ r 12 - • n\ 
\XaXh • ' ' Xi / \ab ‘ — I P 


where in the second form only the subscripts are written. Since 
the order of the columns is immaterial, we may permute them in 
any manner without changing the meaning of s. 

To each permutation a, 6, . . . , Z of 1, 2, . . . , n corresponds one 
and only one substitution. Hence the number of distinct substitu- 
tions on n letters is nl = n(n — 1) * • • 3 *2 -1. 

Instead of the two-rowed notation in (13), we often employ the 
one-rowed notation of cycles ( §66). The relation between the 
two notations will be clear from the following substitutions: 


/I 2 3 4 5 
\2 3 1 4 5 


) = (123), 


(23 
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69. Product of substitutions. Given a substitution § in (13 ) and 
another substitution t, we may permute the columns of ^ in such a 
way that the letters in its upper row are the same as the corre- 
sponding letters in the lower row of s. Then if 

‘-cr.-.D’ 

the effect of appljdng first s and afterwards t is the same as the 
effect of applying the single substitution 

which is called the 'product of 5 by t We write p = st The word 
product is used here in the sense of resultant or compound. 

We may, of course, find the product of substitutions each ex- 
pressed in cycles. For example, 

(12) (34) *(23) == (1342), (23) • (12) (34) - (1243). 

To find the first product, note that the first factor replaces 1 by 
2 and the second factor replaces 2 by 3, whence the product 
replaces 1 by 3; etc. Since the two products of our substitutions 
taken in different orders are distinct, we see that multiplication of 
substitutions is not always commutative. 

When a substitution is composed of two or more cycles (on 
different letters), it may be regarded as a product of commutative 
cyclic substitutions each composed of a single cycle. For example, 
(12) (34) = (12). (34) = (34) *(12). 

Multiplication of substitutions is associative: st'V === s 4v, so 
that the notation stv is unambiguous. Let s, and the product 
st ^ p have the notations in (13), (14), (15). Let 


fa ^ ••• X\ 
^ jS ••• L/' 
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/ ab 

■■1 \ 

, / 12 • 

• • w \ 

\AB ■ 

■ ■ lJ’ 


. . i) 


From this result we readily deduce the associative law for prod- 
ucts of four substitutions a, h, c,d: 


a • bed = a\b * cd] ^ db- cd = [ah • c]d = ahe • d, 


so that the notation abed is unambiguous. The associative law 
for products of any number h of substitutions is readily proved 
similarly by induction on fc. Thus there is a single product of 
k substitutions Ui, . . . , taken in a fixed order, and the nota- 
tion ui Os • • • Ufe is unambiguous. In particular, if the k factors 
are all equal to a, their product is denoted by a* without ambi- 
guity and is called the ^th power of a, 

70. Identity, order, inverse. The substitution which leaves 
unaltered each of the n letters is called the identity and denoted 
by I. 

A substitution p is said to be of order (or period) A; if it is the 
least positive integer such that = I, For example, (12) and 
(12) (34) are of order 2. 

If p = (123 . . ,r),p^ replaces 1 by 3, replaces 1 by 4, , 

and fori < replaces 1 by j + 1, so that the order of p is not 
less than r. Moreover, p^ == I. Hence the order of a single cycle 
on r letters is r. 

If a, 5, . . . are single cycles of r, . . . letters respectively, and 
if no two of these cycles have a letter in common, then [oh •••]*» 
a^b^ • • • will be the identity if and only if = I, 5^ = I, . . , , 
and hence if and only if fe is a common multiple of r, a, ... . 
Hence the order of any substitution is the least common multiple 
of the orders (numbers of letters) of its component cycles^ For 
example, (123) (45) is of order 6, while (12) (3456) is of order 4. 
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Every substitution s has an inverse such that = J, 
5 = J. This follows from 


(16) 


/I 2 • 

’ - n\ 

1 / o 6 • ■ 


\ab • 

■ ■ih 

® “ (l 2 • 

.. J 


Exercises 

1. If s ~ (123), verify that ^ = (132), s* == /, = s®. 

2. If s is of order k, s“^ = 

3. A cycle is not altered by a cyclic permutation of its letters. 

4. If af ~ sr, then t ^ r. 

5 The inverse of (123 • • • r) is (r r — 1 • « • 321). 

7L Quartic equations. To solve 

(17) -{-hoi? + cx^ + dx + e 0, 

transpose the last three terms and complete the square on the 
first two; we get 

{x^ + \hxy = — c)x^ — dx — e. 

The device of Ferrari (1522-1565) consists in deriving an equa- 
tion in which also the second member wiU be a perfect square. 
This is accomplished by adding {o? + ^hx)y + to each mem- 
ber; we get 

(18) (x^ + 46x + = (ih^ - c + y)x? + iiby - d)x-\-ly^ - e. 


Denote the second member by q = rx^ + sx + ^. If it is the 
square of mx + n, then tt? = r, 2mn = == whence 

^ = 4rt. Conversely, let the last condition be satisfied. If r = 0, 
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then s = 0 and g is a constant and hence a square. If r 5^ 0^ g is 
the square of x + Hence rx^ + so; + ^ is the square 

of a linear function of x if and only if = 4rf. Appl3dng this 
result to the second member of (18), we see that it is a perfect 
square if and only if 

(19) — cy^ + (bd — 4e)y + 4ce — e — = 0. 

If yi is a root of this resolvent cubic equation, (18) implies 


+ ibx + |yi == mx + n 
or x^ + |6x + lyi = — mx — n. 

Let Xi and X 2 be the roots of the first of these quadratic equa- 
tions, and xs and X4 be the roots of the second- Then Xi, . . . , X4 
are the desired roots of (17). Note that 

xi X 2 = iyi - n, xz xa = ^yi + n, xi X2 + xs X4 = yi- 

If instead of yi we employ another root ya or yz of the resolvent 
cubic (19), we obtain two quadratic equations different from (20), 
such that their four roots are Xi, X2, X3, Xi paired in a new manner. 
The root which is paired with Xi is now xs or X4. Hence the roots 
of the resolvent cubic (19) are 

( 21 ) = xi X2 + X3 X4, 2/2 = 3^1 X3 + Xa, yz = XiXa + X 2 xs. 

The discriminant of (17) is defined to be the product of the 
squares of the differences of its roots. By (21), 


( 22 ) 


Vl- y2 = (Xi — X4) (X2 - X3), 

yi - Vs = (xi - X3) (x2 - X4), 

y 2 - yt = (xi — X2) (x3 - X4). 
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Taking the product of the squares, we see that the discriminant of 
a guartic equation (17) is equal to the discriminant of its resolvent 
cubic (19). 


Exercises 

1 Tke resolvent cubic of -f 1 = 0 is z/® — 4z/ = 0. Hence -h 1 has 
the pairs of factors ± i, x® ± 2^ x -j- 1, x* ± ( — 2)^ x — 1. 

2. The discriminant of x^ — 8x* -f 22x* — 24x + 9 = 0 is zero, so that 
the equation has a multiple root. 

3. If d = 6, e = 1, so that (17) is a reciprocal equation, the resolvent cubic 
is (2/ — 2)[i/2 + (2 — c)z/ + 5-2 — 2c] = 0 The fact that 2 is a value of one 
of the t/’s in (21 ) follows directly from the fact that the roots are reciprocal in 
pairs. In particular, taking y = 2, we find that x^ + ^+x^+x-pl has the 
factors x^ 4 ^(1 ± B)x -\- 1. 


72. Groups of substitutions, k set of m distinct substitutions 
is called a group of order m if every product of two of them and the 
square of each of them are substitutions of the set. 

For example, the n\ substitutions on n letters form a group, 
called the symmetric group on n letters, since the product of any 
two substitutions on n letters and the square of any one of them 
are substitutions on n letters. Again, if s is a substitution of order 
m, the substitutions I, s, s^, form a cyclic group of order 

generated by s. The iderdity group is composed of the single 
substitution I. 

If all the substitutions of a group belong to another group, the 
former is called a subgroup of the latter. 

As a generalization, we shaR next define a group whose elements need not 
be substitutions. An abstract group is a system composed of a set of elements 
. . . and a rule of combining any two of them to produce their “product, 
such that (i) every product of two of the elements and the square of each 
element are elements of the set, (ii) the associative law holds, (iii) the set 
contains an identity element I such that la = al = a for every element a 
of the set, and (iv) each element a of the set has an inverse belonging to 
the set, such that aa“^ = a = /. 
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In our definition of a group of substitutions, we employed assumption (i ) 
alone The associative law holds ( §69 ) If a is a substitution of order m of the 
set, then — I and belong to the set. Hence for sets of sub- 

stitutions, properties (lii) and (iv) follow from (i) and (ii). 

73. Group leaving a function invariant. Let , X 4 be dis- 
tinct. By (22), the three functions = X 1 X 2 + Xz X 4 ,, — XiXz + 

X 2 X 4 f yz — ^ 3 : 1 X 4 + X 2 xs are distinct. Since ^2 and yz are the only 
other functions of the same form as ^ 1 , it is to be anticipated that 
each of the 24 substitutions on Xi, x^, xs, X 4 replaces yi by yi, or yz 
and hence that |24 or 8 of them leave yi unaltered. To verify 
these statements, note that a = (12) and h = (13) (24) leave 
X 1 X 2 + Xz X 4 unaltered. The same is therefore true of the products 

(1423), d -= ba -= (1324), e = ca = (14)(23), 

6 c =(12) (34), a -be =(34). 

Hence yi is unaltered by the eight substitutions of the set 

Gs= {I, (12), (34), (12) (34), 

(13) (24), (14) (23), (1423), (1324)}. 

Evidently substitution (23) replaces yi by 2 / 2 - Hence the eight 
distinct products of the eight substitutions of Gb by (23) all re- 
place 2/1 by yz- Similarly, the eight distinct products of the eight 
substitutions of 0$ by (24) aU replace yi by yz. There is no sub- 
stitution in common with two of these three sets of eight sub- 
stitutions, since they replace 2/1 by yi, yz, yz, respectively. Hence 
together they give all of the 24 substitutions on xi, xz, Xs, X 4 - We 
conclude that every substitution, on the x's replaces yi by 2 / 1 , 2 / 2 , 
or yz, so that 2/1 is a three-valued function. Moreover, we con- 
clude that Gs is composed of all the substitutions on the x’s which 
leave yi unaltered. Hence every product of two equal or distinct 
substitutions of Gb is equal to a substitution of Gb, so that Gg is a 
group of order 8. 
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We shall say that the function XiXo + Xs X4. belongs to the group 
(?8 since it remains unaltered by all the substitutions of Gs and is 
altered by all further substitutions on Xi, X2j Xg, X4. 

We saw that all eight substitutions of Gs may be obtained by 
successive multiplications starting with a and 6. Hence the 
group Gb is said to be generated by a and 6. 


Exercises 

1. Since = xi rcs + xz X 4 is derived from yi by interchanging and 
the group Hs to which 2/2 belongs is derived from G& by interchanging 2 and 3 : 

Hs = {/, (13), (24), (13) (24), (12) (34), (14) (23), (1432), (1234)}. 

2. By interchanging 2 and 4 in Us, obtain the group 

Ks = !J, (14), (23), (14) (23), (13) (24), (12) (34), (1243), (1342)} 
to which 2/3 - X 1 X 4 -r Xi xz belongs. Find two generators of this group 

3 For t = xi + X2 ^ Xz — X4, we have 4 :yi + — 4c. Hence a sub- 

stitution leaving t unaltered leaves also yi unaltered. Show that t belongs to 
the subgroup (/, (12), (34), (12) (34)} of Ug. 

4 The six functions obtained from 4 > — yi uy^ o^yz by applying the 
six substitutions on yi, 2 / 2 , ys are all distinct if . . . , X 4 are independent vari- 
ables Then 4> belongs to the group G composed of the substitutions on the 
x’s which leave 2 / 1 , 2 / 2 , 2/3 simultaneously unaltered. Hence G is the greatest 
common subgroup of (xg, Hg, iTg, so that 

G = fJ, (12) (34), (13) (24), (14) (23)}. 

Verify also that (j is a commutative group. 

5. For three distinct x^s, (xi — X 2 )(xi — X 3 )C% — Xg) belongs to the 
group j J, a = (123 ), (132 ) } . Verify also that this is a cyclic group generated 
by a. 


74 . Altematmg group. A transposition is a substitution, like 
( 13 ), which interchanges two letters and leaves unaltered the 
further letters. Every sutistitution is a product of transpositions. 

^And in further cases. See Ex. 5 of §67. 
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For, it is a product of cycles on different letters, while a single 
cycle with n letters is a product of n — 1 transpositions: 

(123 --n-ln) =- (12) (13) • • * (1 - 1) (1 n). 

But the same substitution can be decomposed into transpositions 
in various ways. For example, 

(123) = (12) (13) = (13) (23) - (13) (23) • (45) (45). 

Of the various decompositions of a given substitution s into 
transpositions, either all contain an even number of transpositions 
(and s is then called an even or positive substitution), or all con- 
tain an odd number of transpositions (and s is called an odd or 
negative substitution). This is proved by using the alternating 
function 

P = (Xi- X2) (Xi - Xz) (Xi - X4) (Xi ~ Xn) 

• (X 2 - Xz) iX 2 - Xa) • • ‘ ( 0:2 Xn) 


* (Xfi—l Xn ) ) 

in which xi, . . . , Xn are assumed to be distinct- Any transposition 
(Xi Xj) with i < j merely changes the sign of P (see the preceding 
Ex. 5). For, the factors of P which involve neither Xi nor Xj are 
evidently unaltered. The factor Xi — Xj involving both is changed 
in sign. The remaining factors may be paired to form the products 

4; (x^ — xjb) (xj — Xk) [A = 1, . . . , n; fc 7 *^ i, k 9 ^ j]. 

Such a product is unaltered- Hence P is merely changed in sign. 

Hence if « is a product of an even number of transpositions, it 
leaves P unaltered. But if ^ is a product of an odd number of 
transpositions, it replaces P by — P. This proves the above abor- 
tion which justifies the terms even and odd substitutions. 
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All even substitutions on n letters therefore form a group, called 
the alternating group on n letters. It is the group to which the 
alternating function P belongs. 

75. Theorem. If a group G contains an odd substitution t, exactly 
half of the substitutions of G are odd. 

For, the products of its even substitutions by t are distinct and 
odd, so that G contains at least as many odd substitutions as 
even ones. Again, the products of its odd substitutions by t are 
distinct and even, so that G contains at least as many even sub- 
stitutions as odd ones. The two results show that G contains 
exactly as many odd as even substitutions. 

It follows that the alternating group on n letters is of order 
I -n!. Since it is a subgroup of the symmetric group of order nl, 
we have an illustration of the next theorem. 

76. Theorem. The order of any group is a multiple of the order 
of any subgroup. 

Let a group G of order r have a subgroup H of order s < r, so 
that G contains a substitution g 2 not in H. Write Hg^ for the set 
of 5 substitutions obtained by multiplying each substitution of 
H by g 2 . If a substitution hg^ of Hg^ were equal to a substitution 
A' of then = h~^ h' would belong to the group H, contrary to 
h 3 rpothesis. Hence there are 2s distinct substitutions in the set 
H + Hg 2 ) which denotes the aggregate of the sets H and Hg^, 
If r = 2^, the theorem is proved. If r > 2s, G contains a sub- 
stitution gs not in i? or Hgt, As before, no substitution of Hgz 
belongs to If a substitution hgz of Hgz were equal to a sub- 
stitution h' g 2 of Hg^j then gz = A' gz = hi gs would belong to 
Hg 2 , contrary to hypothesis. Hence there are Zs distinct sub- 
stitutions in the set H + Hg^ + Hgz. Unless the latter coincides 
with G, we repeat the argument using a new substitution g 4 . 

If (? is a group on n letters, its order does not exceed ^^L Hence 
the above process terminates and gives 
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G^H + Hg, + Hg,+ • • * + Eg., 

Thus G is of order r = su. This integer u is called the index of H 
under G, 

Exercises 

1. The order of any group on n letters is a divisor of n !. 

2. The order of any substitution of a group G divides the order of G. 

3. Hence a group of prime order is cyclic. 

4. The alternating group on 4 letters is composed of /, (12) (34), (13) (24), 
(14) (23), and the eight cycles of three letters. 

5. All the substitutions common to two groups form a subgroup of each. 



Chaptee '^T^II 

MELDS, EEDUCIBLE AND IRREDUCIBLE FUNCTIONS 

This chapter presents certain concepts and elementary theorems 
which will be needed in the Galois theory of equations. 

77. Fields, adjtinction. A set of complex numbers is called a 
number field if the sum, difference, product, and quotient (except 
by zero) of any two equal or distinct numbers of the set are them- 
selves numbers of the set. Examples were given in §28. 

All rational functions of one or more variables with coefficients 
in a number field form a function field. In the Galois theory of 
equations, we employ a field 22(&i, . , . , composed of all 
rational functions with rational coefficients of h, , km, either 
all of which are constants, or certain of which are constants and 
the others are given functions of specified variables not neces- 
sarily independent. For example, J2(3^) is composed of all 
numbers 

a + bZ^ 
c + dS^’ 

in which a, 6, c, d are rational numbers and c and d are not 
both zero. Multiplying numerator and denominator by c — dZ^, 
we see that R(Z^) is composed of all numbers e + fZ^ in which 
e Bud f are rational. Hence iJ(3^ a;) is composed of all rational 
functions of x whose coefficients are of the form e + fZ^ with 
e and / rational. 

The elements, whether numbers or functions, of a field are con- 
veniently called quantities. 

Although the set composed of the number zero alone satisfies 
the above definition of a number field, we shall exclude it and 
hence agree that every field F contains a quantity q 9^ 0. Hence 
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F contains q/q = 1, 1 + 1 = 2, etc., and therefore contains all 
rational numbers. In other words, the field R of all rational 
numbers is a subfield of every field F. The foregoing field 
R (kij . . . , fcm) is said to be derived from R by the adjunction of 
ifci, . . . , few to R. If n < m, R{ki, . , • , kj^) is derived from 
R{ki, . . . , by the adjunction of k^^i, , , , jk^. 


Exercises 

1. If = — 1, every number of R{i) can be expressed in the form e + /i, 
where e and / are rational numbers. 

2. R{Z^) = F(12i). 

3. Write r = 3^ f + § r. Then R{r) = R ( 03 ) ^ R (6>®). 

4. If s = f + 2i, R(i, 21) = R(s). Hint: i — 21 = — 3/s 

5. In Ex. 4, jK (s) is derived from R{i) by adjoining 21 

78. Greatest common divisor. Let f(x) and g{x) be poly- 
nomials in X with coefficients in a field F, such that g{x) is not 
identically zero. Let the division of f{x) hy g(x) yield a quotient 
a(x) and a remainder r(x) whose degree is less than that of 
g(x). Similarly, let the division of ^ by r yield a quotient 6 and 
a remainder s whose degree in a: is less than that of r. Proceeding 
in this manner, let 

f = ga + r, g = rb + s, r = sc + s = 

where the fourth remainder has been assumed to be identically 
zero, to simplify the notations. Since a, r, 6, . . . were obtained by 
rational operations, aU of their coefficients are quantities of F, 

Any common divisor of f and g is seen in turn to be a divisor of 
r, a, and t By employing the equations in reverse order, we see 
that i is a divisor of s, r, g, and/. Since t is a divisor of / and g and 
since any common divisor of them is a divisor of we call t a 
greatest common divisor of / and g. It is determined uniquely apart 
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from a factor belonging to F; in particular, there is a single greatest 
common divisor having unity as coefficient of the highest power 
of X. 

The displayed equations give in turn 

r f - ga, s = g(l + ah) - fb, 

i = /(I + he) — g{a + c + ahc). 

Theorem 1. Any two polynomials f(x) and g{x), not both 
identically zero with coeffijcients in a field F, ham a unique greatest 
common divisor d{x) whose leading coefiicient is unity. All of its 
coefficients belong to F, and there exist two polynomials u{x) and 
v{x) with coefficients in F such that 

d(x) = u(x)f(x)+ v{x)g(x). 

The preceding discussion applies also when / and g denote 
integers. Let g be positive. Then the division of f hj g yields 
an integral quotient a and an integral remainder r such that 
0'^ r < g. Hence / and g have a unique positive greatest com- 
mon divisor. In case it is unity, we call / and g relatively prime. 
Hence we have 

Theorem 2. If f and g are relatively prime integers^ there exist 
integers u and v such that uf + vg = 1. 

79. Roots of algebraic equations. Consider an equation 

(1) / (x) = + Cl + Cn = 0. 

When = 3 or 4, we found formulas in Ch. VII which express 
the n roots of (1) in terms of the coeflheients Ci. But when n > 4, 
no such formulas are known except for special equations like 
— c = 0. 
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What shall be meant by roots of (1 ) when the coefficients d are 
independent complex variables? In that case the method ( §78) 
of finding the greatest common divisor of f{x) and its derivative 
f {x) leads to a final remainder r(ci , . . . , Cn) which is free of x 
and is not zero identicaUy. For, if ^ — I, then 

/ (x) = m”-! and r = - 1. Let Ci, . . . , Cn be constant values 
of Cl, , Cn such that r(Ci, . . . , Cn) 0. Then the resulting 
function (1) has no factor in common with its derivative, and 
equation (1 ) has n distinct roots. Since the pol3aiomiaI r is a con- 
tinuous function of the c^s, we have r(ci, . . . , Cn) 9 ^ 0 for all sets 
of values of Ci, . . , , Cn for which each difference Ci — Ci is suffi- 
ciently small in absolute value. For each such set of values of 
Cl, — , Cn, equation (1) has n distinct roots Zi, . . . , Zn, and 
each Xi is known^ to vary continuously when the c^s vary con- 
tinuously. These numbers X, are therefore the values of a con- 
tinuous function Xi of Ci, . . . , Cn for those values of the c^s such 
that each difference Ci — Ct is sufficiently small in absolute value. 
These functions Xi, ,..,Xn are called the roots of (1), In brief, 
the root values may be assembled into root functions. 

If the Ci are not independent variables, but are functions of 
certain variables Vj such that r(ci, . . . , Cn) is not zero identically 
in the Vj, the preceding discussion remains valid if we consider 
only sets of values of Ci, , . . , Cn which are obtained by varying 
the continuously. 

80. Reducibility. A polynomial in one variable x with coeffi- 
cients in a field F is called veducihle in F if it can be expressed as a 
product of polynomials in x, neither a constant, with coefficients 
in F. It is irreducible in F ii no such factorization is possible. 

For example, — 4 is reducible and -■ 2 is irreducible in 
the field R of rational numbers. But — 2 is reducible in jffi (2^). 

According as /(a;) is reducible or irreducible in F, the equation 
/(x) = 0 is called reducible or irreducible in F. 

^Weber’s Lelirbucli der Algebra, ed. 2, 1, 1898, 148. Cf, CooMge, Anytala o! 
Math., (2), 9, 1908, 116-8. 
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Theorems. Letf(x) and g(x) be polynomials ‘with coeffideMs 
in afield F and let f(x) be ineducible in F. If (me root oc of f{x) = 0 
satisfies g(x) = 0, then f{x) is a divisor of g(x). 

The greatest common divisor d(x) of/(x) and g(x) is not a 
constant, since it has the factor x — a. By §78, the coefficients of 
d{x) belong to F. The quotient oif{x) by d{x) is a constant, 
since otherwise f{x) would not be irreducible in F. Hence 
d{x) = r/(x), where c is a quantity independent of x belonging 
to F, But d{x) divides g{x). Hence /(a:) divides gix). 


Exercises 

1 A root of an equation irreducible in P does not satisfy an equation of 
lower degree with coefficients in F 

2 An irreducible equation / (x ) = 0 has no multiple root. Hint : Consider 
fix) - 0 

3. Two equations each irreducible in F are identical if they have a com- 
mon root 

4. + 1 is reducible in any field which contains one of the four numbers 

2^, (— 2)^, ^ 1)^, p = (1 but is irreducible in all other 

fields. Hints: Its linear factors are x ± p and x ± p*"^, while its pairs of 
quadratic factors are ± % x^ ± 2ix 4- 1, a:® ± 2Hx — 1. 

81. Gauss’s Lemma. If a polynomial f (x) with integral coeffir 
dents j that of the highest power of x being unity, is the product of two 
polynomials with rational coefiudents, 

+ 6i + • * • + 6^, 

\p{x) = + Cl x’*-^ + • * • + Cn, 

then these coefficients are dll integers. 

Let the fractions 6i, . . . , 6^ be brought to their least positive 
common denominator Bq and write b^ = Bi/Ba. Then Bo, 
Bi, . . . , Bm are integers having no common integral divisor > 1. 
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Similarly, let c* = Ct/Co, where Co, Ci, . . . , Cn are integers hav- 
ing no common divisor > 1. The theorem is obinonsly true if 
Bq = 1, Co = 1. Next, let Bq Co > 1. Multipljnng the mem- 
bers of / = Bq Co, we get 

( 2 ) BoCQf(x)^gh, 

where 

g = Bqx^ + Bi x^-^ + ' ' + Bm, 
h = Co x^ + Cl x^-^ + • • • + C,^. 

Let p be a prime factor of Bo Co- Since p divides each coefficient 
of the left member of (2), it divides each coefficient of gh. Not all 
coefficients of g are divisible by p. Let B^ be the first coefficient of 
g which is not divisible by p. Let Ca be the first coefficient of h 
which is not divisible by p. The total coefficient of in 

gh is 

• * • -b Bx+ 2 Ck^2 + B^^l Ck-i + Bi,Ck 

Ck+l + 5t~2 Ca4.2 + • • • 


Since S^_i, Bi ^^ , . . . , and Ca„i, Cfc^ 2 , . . . , Co are all divisible 
by Pj while jB^ Ca is not, the preceding sum is not divisible by p^ 
contrary to the above. 


Exercises 

1. Gauss’s Lemma for the case m = 1 shows that if an equation s’" 4- 
«i + • * * + ar == 0 with integral coefficients, that of the highest power of 
X being unity, has a rational root, that root is an integer. 

2. An integral root of the equation in Ex. 1 is an exact divisor of the con- 
stant term Ur- Hint: It divides all terms except the last 

3. Show that — 2, x® — 3x -f 1, and — 7a; 4- 7 are all irreducible in the 
field R of rational numbers. Hint: If such a cubic function were reducible in 
JS, it would be the product of a linear and a quadratic function with coeffi- 
cients in R, Apply the theorems in Exs. 1, 2. 
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82. Irreducibility of A. 

Theorem 4. If p is a prime and if A is a quantity of a field F 
such that A is not the pth power of any quantity of F, then — A is 
irreducible in F, 

An imaginar j pth root of unity is given by 
p = cos 2t/p + i sin 2'r/p, 

since = 1 by De Moivre's theorem. By the same theorem, 
p^ 1 if 0 < Z < p. Hence 1, p, p^, . . . , p^”^ are aU distinct and 
give all the pth roots of unity. Thus if r denotes one root of 
= A, all its roots are given by 

(3) r, pr, p2r, . . . , pP-^r. 

Suppose that — A has a factor f{x) of degree t <p with 
coefficients in F, that of x^ being unity. The product of the roots 
of /(rc) = 0 is, apart from sign, equal to the constant term. 
Hence p« belongs to F. By Theorem 2, there exist integers y 
and z such that ty — pz ^ L Hence F contains 

(p« = p82/ ^PZ+I = pSy y = yf 

where r' is one of the roots (3). Since is in F, also r' is in F. 
In other words, A is the pth power of a quantity r' of F, contrary 
to hypothesis. This contradiction proves that — A is irreducible 
in F. 

The theorem is illustrated by Ex. 3 of §81. 

83. Irreducibility of the cyclotomic equation. If p is a prime, 

the equation 

— 1 
X — 1 
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has as its roots all the imaginary pth roots of unity. As a general- 
ization, the equation 

^ + = 0 (i = p*-i) 

^ X 

has as its roots all the primitive p*th roots of unity ^ viz., roots of 
= 1 which do not satisfy = 1. 

Let p be an arbitrarily chosen primitive p^th root of unity. 
Then p^ p^^", . . . , p^^" = 1 give all the t roots of = 1. The 
remaining e = tp ^ t powers of p, with positive exponents 
at, U 2 , . . . , Ue less than tp and not divisible by p, are roots of 
f(x)=0 and give all its roots. 

For example, if p* = 9, then p^, p% give the roots of = 1, 
while p, p2, p^, p^j p^j p8 are the roots of x®4-x^ + l = 0 and 
are the primitive ninth roots of unity. 

Suppose that f{x) is reducible in the field of rational numbers. 
Then by Gausses Lemma, /(a?) is the product of two poljmomials 
4>{x) and neither a constant, ha\dng integral coefiScients, 

such that each has unity as the coefficient of the highest power of 
X, Since /(I) = p, we have p = <^>(1) Since p is a prime, 

one of these integers, say ^(1), has the value ±1. At least one of 
the roots p“h . . . , p""® of /(a;) = 0 is a root of 4>{x) = 0; hence 


* •* == 0 . 

Here p is an arbitrarily chosen primitive p*th root of unity and 
hence is any one of the roots of f{x) = 0. This proves that the 
function 

P{x) ^ - 

vanishes when x is replaced by any one of the roots of f{x) == 0 
(which has no multiple root). Hence 


Pix) ^f{x)^q{x)j 
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where the polynomial quotient q(x) has integral coefficients since 
the leading coefficient of the divisor fix) is unity (or by use of 
Gauss’s Lemma), Taking x = 1 and recalling that /(I) = p, we 
get 

[4>(l)]^ -(± 1 )^ 

where q(l) is an integer. Since this is impossible, we have the 
following 

Theorem 5, The cyclotomic equation f{x)=0 whose roots are 
the primitive pHh roots of unity, where p is a prime, is irreducible in 
the field R of rational numbers. 

Exercises 

1. For p- = 3, f{x) For = 2% f(x) ^ + 1. Verify 

that these quadratic functions are irreducible in R. 

2. For p* = 5, /(x) = ar* -f h a;* + a? + 1. Verify that it is irreducible 

in jR by discussing /(a:) « (x^ aa; r) (^s + r*"!). 



Chapter IX 

GROUP OP AN EQUATION FOR A GIVEN FIELD 

84. introduction. With a given algebraic equation we shall 
associate a certain group G of substitutions on its roots. The 
theory was initiated by E. Galois/ who was killed in a duel in 1832 
at the age of 21. In a later chapter we shall prove that the equa- 
tion solvable by radicals if and only if G is a group of the kind 
called solvable and shall conclude that the general equation of 
degree five or more is not solvable by radicals. In another chapter 
we shall apply our theory to various questions ignored in elemen- 
tary geometry and, for example, prove that it is not possible to 
trisect every angle with ruler and compasses. 

85. Equal functions of the roots. Consider an equation 

(1) f(x) ^ + Cl + . . . = 0 

whose coefficients are functions of certain variables Vf, In § 79 
we explained the sense in which (1) has n roots which are func- 
tions of the Vj, 

Two polynomials <l> and in the roots Xij , . . ^ Xn are caled 
eqiial if they have the same value for all sets of values of Ci, . . . , 
employed there in defining the roots Xi, In case <j> and ^ are 
rational functions of the Xi, we impose the further restriction on the 
Ci that the denominators of ^ and 4^ shall not be zero. 

We shall call <j> and -4 distinct if they are not equal. 

For example, consider the reciprocal quartic equation 

(2) + ax^ + hx? + ox -f- 1 = 0, 

1 Jour, de Math., 11, 1846, 381-444; CEuvxes, 1897; Manuscrits de Galois, 1908. 
For sources on Galoises life, see Bull. Amer, Math. Soc., 4, 1897-8, 332; Scientific 
Monthly, 1921, 363-75. 
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where a and h are independent complex variables. Since the 
reciprocal of any root is also a root, we may choose the notations 
of the roots so that xix^ = 1, Xz x^l = 1. Hence the two functions 
Xi Qih and Xz are equal. Thus the rational functions xi/xz and 
Xifx 2 are equal. Again, for 


t — xi + X2 — Xz -- Xi, yi - X 1 X 2 + Xz x^, 

we have 

«"= (Z a;i)= - 413 xi% + 42/1. 

Hence the functions and — 46 4- 4yi are equal. 

If the function 4/ is derived from ^ by applying the substitution 
5 on Xi, . . . j Xn and if ^ and 4) are equal in the foregoing sense, we 
shall say that <?S is unaltered by the substitution s. 

For the quartic {2), Xi xz is unaltered by the two substitutions 
(13) (24) and (1324), but is altered by (13). 

86. Function of the n roots with n! values. In our study of an 
equation /(x) = 0 of degree n, we may assume without real loss 
of generality that the n roots are distinct. Otherwise, /(x) and 
its derivative fix) would have a greatest common divisor g(x) 
not a constant, and f(x)/gix) = 0 has no multiple root and is 
satisfied by every root of /(x) = 0. Hence let/(x) = 0 have the 
distinct roots Xi, . . . , x^. We shall prove 

Theorem 1, There exist integers mi, , , , ,mn such that 

(3) Fi = mi Xi + m 2 X 2 + * • * + 

gives rise to nl distinct functions when the n\ substitutions s on 
xi , — ,Xn are applied to Fi. 

The subscript 1 indicates that we denote the identity substitu- 
tion by 1 instead of J. 

For ?2. = 2, we may evidently take Fi = xi. The case n = 3 
was discussed in Ex. 5, §67. 
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For any n > 1, we are to prove that we can choose integers 
mi^ . • . which satisfy no one of the 'p — n!(n! — l)/2 equa- 
tions Vs = V tj where s and t are distinct substitutions. If s is 
the identity and t = (xix^), the equation reduces to mi = m2, 
and this is the only one of our p equations which involves only 
nil and It is not satisfied if mi == 0 , m2 = 1 . 

Next, let s and t leave X4, . . . , Xn unaltered. If s and t replace 
xi, X2, xz by Xt, Xjj Xk and Xa, Xc, respectively, then Vs = V t 
becomes 

mlX^ + TThXj + mz Xk = mi Xo + x& -|- m3 x^- 

Equations of this type are the only ones of our p equations which 
involve only mi, ??^2, m3. Since the case in which m3 does not 
occur was treated before, we may assume that k 9^ c. For mi = 0 , 
TTiz — Ij the displayed equation reduces to 


Xk — Xc 


Hence we choose as m3 any integer which is distinct from each of 
these fractions, finite in number. 

Next, let s and t leave xs, . . . , Xn unaltered, but replace X4 by 
different roots. For mi == 0 , m2 = 1 , and for a chosen integral 
value of m3, Vs — V t expresses 2b fractional function of the 
roots. Choose as any integer which is distinct from each of 
these fractions. 

In the final step of the proof we employ substitutions s and t 
which alter x^, and choose m^ as an integer distinct from each of 
a finite number of fractional functions of the roots. 


Exercises 

1 . The roots of x^-}-x^ + x + l= 0 are xi = — 1 , xg = — i. 

Show that xa 4“ mg X3 is six-valued if ms is distinct from 0, 1, ± i, 1 ± i, 
i (1 ± i), and hence if m3 = — 1. Ihus also xg — xi is six-valued. 
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Denote tlie six substitutions on xi, xa, xz by 

J, a = (12), h = (13), c = (23), d - (123), e = (132). 

Compute tbe following functions and verify that they are distinct: 

7i = Xa ~ Xi = 1 4- i, Vb = X2 — xj = 2i, Ve = xz — xi = 1 — ij 

F. = -F.. 

2. If xi, . . . , x„ are independent variables, the function (3) has nl distinct 
values if mi, . . . , mn are any distinct constants. 

87. Galois resolvents. Let Si, . . . , Sr denote the substitutions 
on Xi, . . . , Xn, where r = nl and Si is the identity substitution. 
Let Vi be the r-valued function (3). If we apply Sy to Fi and then 
apply to the resulting function Vsj, we obtain Vs^ where 
Si = Sj si. When k is Jdxed, but j takes the values 1, 2, . . . , r, 
then I takes the same values in some new order. Hence sj, merely 
permutes Fi, . . . , amongst themselves. The elementary 
symmetric functions of these F^s are therefore sy mm etric func- 
tions of Xi, . . . , Xny and hence are polynomials in ci, Cn with 
integral coefficients. Let F be any field containing ci, . . . , Cn. 
Thus the coefficients of the polynomial defined by the expansion of 

(4) P(F) ^ (F - Fx)(F -- F^) • . . (F - F..) 

are quantities in P. 

If F(F) is reducible in P, let G{V) be that factor irreducible 
in P for which (t(Fi) = 0. But if P(F) is irreducible in P, take 
(?(F) to be P(F) itself. In either case, (r(F) = 0 is an equation 
with the root Fi whose coefficients belong to P and which is 
irreducible in P; it is called a Galois resolvent of equation (1 ) for 
the field P. 

The corresponding resolvent of the equation in Ex. 1 of §86 
for the field of rational numbers is evidently 

GiV) ^ (F - Fx)(F - Fe) ^ F^ -- 2F + 2 = 0; 


but for the field P(f), the r^lvent is F — Fi = 0. 
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Theorem 2* Let <p(xij . . , , Xn) be any polynomial, with coeffir 
denis in a field F, in the roots Xi of an equation with coefficients 
in F. Let s he any substitution on the roots and let it replace 4> by 
and Vi by where Vi is the nUvalued function (3) with in’- 
tegral coejfidents. Then 


( 5 ) 


P'(F.)’ 


where X is a polynomial with coefficients in F, while is the deriva- 
tive of the polynomial (4) whose coefficients belong to F, whence 
P"(Vs) ^ 0. Thus <f>a is the same rational function p(Fa) of F® 
that 4) ^ of Fi. 

Let r — n\, I = Sr* Evidently Sh permutes . . . , in the 
same manner that it permutes Fx, . . . , Vi. Hence the fractions in 


( 6 ) 


X(F) = <^>l 


P(V) 

F -- Fi 




P(V) 

V- Vi 


are merely permuted amongst themselves by any substitution on 
oji, . . . , Xn- Replacing P(F) by the product (4), we see that 
X(F) becomes a polynomial in F whose coejBSicients are integral 
rational symmetric functions of Xi, . , . , Xn with coefficients in 
F, and hence are equal to quantities in F, For F = F® each 
fraction in (6) becomes zero except that with the denominator 
F — F®, If in the quotient of the product (4) by F — F« we 
take F = V», we get the value of the derivative of (4) for F = F®. 
Hence 

X(F®) - <#>®P'(F®). 


In case the degree of the Galois resolvent GiV) = 0 is less than nl, we may 
express the numerator and denominator of = X(yi)/P^(Fi) as polyno- 
mials of lower degree in Fi by means of G{Vi) = 0. Let r (Fi) be the resulting 
fractional expression for It does not now follow (as in the last part of the 
theorem) that we can apply any substitution s to the members of the relation 
^ = r(Fi) and obtain a correct relation = r(F®). To obtain the latter 
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we would need to reduce the fraction (5) by means of G(Vg) = 0, which 
however is not known to hold unless s is one of the special substitutions for 
which Vg is a root of G(V) =0. For the remaining substitutions a, Vg is a 
root of a factor, other than of P(F) in (4). This discussion brings us 

to a critical point in our theory and indicates the care which must be taken in 
its development 


88. The group of an equation for a given field. Let jP be a 
field containing the coefficients of an equation (1) whose roots 
Xij , . , jXn are distinct. Let the roots of a Galois resolvent 
G{V) = 0 of degree g be 

(7) Fi, F,, F,, . . . , F„ 

in which the subscripts denote the substitutions on , Xn by 

which these F’s are derived from Fi. 

Theorem 3. These g substitutions 

(8) l,a,b,...,p 

form a group G, called the group of the given equation (1) for the 
field F. 

We are to prove that the product rs of any two equal or distinct 
substitutions (8) is one of those substitutions. Take F, as the 
function (^> in Theorem 2. Then 


Vr 


HVi) 

P'(Fi)’ 


Vr. = (Vr). 


hjV.) 

P'iV.y 


where X(F) and P'(V) are polynomials in F with coefficients in 
F. Since F, is a root of G(F) = 0, the equation 
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is satisfied when V = Vi. The product of the left member by the 
gih. pow'er of P' (F) is a polynomial H{V) in V which vanishes for 
V = IT and has quantities in F as coefficients. Hence ( §80) the 
root Fs of the equation (?(F) = 0 irreducible in P is a root of 
H (F) = 0. Since P'{Vs) ^ 0, we may divide H (F^,) by the gth 
power of P^(Fs); we get 

Hence Vrs is one of the functions (7), so that rs is one of the 
substitutions (8). 

ExampIxE. For the field R of rational numbers, a Galois resolvent ofr® + 
X- + a: 1 =0 was seen in §87 to have the two roots Vi and Fc- Hence the 
group of this cubic for P is fl, }. But for the field P (i), a Galois re- 

solvent is V ~ Th = 0 and the group is the identity. 

89. Characteristic properties A and B of the group G of a given 
equation for a given field F. 

A. If a rational function with coefficients in F of the roots of an 
equation with coefficients in F remains unaltered in value by all of 
the substitutions of the group G of the equation for P, it is equal to a 
quantity in P. 

Let be a quotient of two pol 3 momials in the roots with 
coefficients in P. The coefficients of the equation are of course 
restricted to values for which ^ 0. We have formula (5) and 

similarly 

p'(jS 

where /i(F) is a polynomial in V with coefficients in P. Since 
9 ^ 0, m(Fi) 0. If s is any substitution of then ^ 0. 
For, if fi{V) — 0 has the root F«, it has also the root Fi in com- 
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mon with the irreducible Galois resolvent G(F) = 0 (§80). 
Hence the functions 




(s = 1, a, b, p) 


are defined for each substitution s of the group (?. 

Suppose that these g functions are equal, or in other words that 
if is unaltered by all substitutions of G. Then 


f ( X(Ph) X(T a) 

The second member is a rational symmetric function with coeflB.- 
cients in F of the roots (7) of G{V) = 0 and hence is equal to a 
rational function of its coefficients, which belong to F. Hence 
(t>/if is equal to a quantity in F, 

B. Conversely, if a rational function of the roots with coefficients 
in F is equal to a quantity in F, it remains unaltered in value by all 
of the substitutions of G. 

Let <j>/i/ = r, where r is in F. Then X(F)/m(F) — r vanishes 
for F = Fi. Hence the equation X(F) ~r/4(F) = 0 with 
coefficients in F is satisfied by every root F^ of the irreducible 
equation G(F) = 0 (§80). Hence 


X(F.) ^ 
m(F.) if. 


(5 = 1, a, 6, , p), 


so that (p/if is unaltered by all the substitutions of G. 

Since an yil-valued function Fi with coefficients in a given field 
F can be chosen in an infinitude of ways, there are infinitely many 
Galois resolvents G(F) =0. Our definition of the group G of 
the given equation for the field F was based upon a single such 
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resolvent, which is determined by the special Vi chosen. But 
different functions Vi lead to the same group. In fact, G is uniquely 
determined by the given equation and field F. This is a consequence 
of the following 

Theorem 4. The group of a given equation for a given field F is 
uniquely defined by properties A and B. 

First, suppose that H = {1, r, . . . , m} is a group for which 
property A holds. Then the coefl&cients of 

^(V) ^ (7~ V^){V^Vr) ••• (F-- F,.), 

being symmetric functions of Fi, Fr, . . . , are unaltered by 
the substitutions of H and hence are equal to quantities in F. 
Since the equation <i>{V) = 0, with coefficients in F, admits one 
root Fi of the irreducible Galois resolvent G(V) = 0, it admits 
all of the roots (7) of the latter ( §80). Hence 1, a, . . . , p occur 
among the substitutions of jH, so that <? is a subgroup of ff , 

Second, suppose that X = {1, p, . . . , is a group for which 
property B holds. Then the Galois function G(Fi), being equal 
to the number zero of F, remains unaltered in value by the sub- 
stitutions of K, so that 

0 = G(Fi) = (?(Fp) = ... = (?(F,). 

Hence F^, . . . , F^ occur among the roots (7) of (?(F) = 0. 
Thus is a subgroup of G. 

If iJ = X is a group for which both properties A and B hold, 
the two results show that this group coincides with G. 

In view of its repeated application below, we state the result 
obtained in the second case as the 

Theorem 5. If every rational function of the roots with coeffir 
dents in F which is equal to a quantity in F is unaltered in value by 
every substitution of a group K, then K is a subgroup of the group G 
for F of the equation. 
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90. Transitive and regular groups. A group of substitutions 
on j Xn is called trandtive if it contains substitutions Si 

which replace xihj x^ for ^ = 1^ . . , ,n. Then it contains a sub- 
stitutions,"^ Sj which replaces X, by X;. A group which is not transi- 
tive is called intmndtive, A transitive group of order n on n 
letters is called regular. 

The symmetric group on n letters and the group 

(9) (?4= {J, (12)(34), (13)(24), (14)(23)} 

on four letters are both transitive. The latter is regular, while the 
former is not if n > 2. But {/, (12), (34), (12) (34)} is an 
intransitive group. 

Theorem 6. The order of any trandtive group G on n letters is 
dividble by n. 

All those substitutions of G which leave xi unaltered form a 
subgroup E, Since G is transitive, it contains a substitution Si 
which replaces xi by x,. Hence 

G = H Hs2 4“ Hs^ -p •••-(“ Hsnj 

where every substitution of the set Hsi replaces xi by x, and 
hence is distinct from each substitution of any set Hsf having 
j 9 ^ i. Next, every substitution ^ of 6^ is in one of the sets Hsi, 
For, if t replaces Xi by x*, then tSk~^ leaves xi unaltered and hence 
is a substitution h of whence t = hsk- Hence the order of G 
is the product of the order by H by n. 

Theorem 7. If an equation is irreducible in a field F, its group 
for F is trandtive^ and conversely. 

Let /(x) = 0 be irreducible in F, Contrary to the theorem, 
suppose that its group G for F is intransitive and contains sub- 
stitutions replacing Xi by xi, X 2 , . . . , Xm, but none replacing Xi by 
one of Xm+i, . . . , Xn. Consider any substitution s oi G and let it 
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replace Xt {i g ??i) by x,-. Since G contains a substitution t wbicK 
replaces Xi by x„ it contains ts which replaces Xihy x,. Hence 
j ^ m, so that s permutes xi, ... ,Xm. amongst themselves, and 
leaves unaltered any symmetric function of them. Hence property 
A of §89 shows that the coefficients of the polynomial in x given 
by the expansion of 

g(x) = (x~ xi)(x - xs) ■■■ (x ~ a:„) 

are quantities of F. Thus fix) has the factor gix) in F, contrary 
to its irreducibOity in F. 

To prove the converse of the theorem, let G be transitive and 
suppose that/(x) is reducible in F. Then/(x) has a factor g(x) 
of degree m <n with coefficients in F, such that g(x) = 0 has 
the root Xi. Since g(xi) is equal to the number zero of F, property 
B shows that it is unaltered in value by every substitution of G. 
Since G is transitive, it contains a substitution replacing Xi by any 
chosen Xi (i g n). Hence gix,) = 0, in contradiction with 

m <71. 

Example. Find the group (? of a? - Ts + 7 = 0 for the field R of rational 
numbers. 

Since the equation is irreducible in R (Ex. 3, §81), G is transitive. By 
§67, tbe square of 

(10) ^ {xi— Xi) (Xi ~ ais) (xz ~ xi) 

is the discriminant 49 of our cubic equation, so that ^ is equal to a number 
± 7 of E. Since any transposition replaces ^ by - ^ and hence alters ^ in 
value, it is not in G by property B. The two results show that G I J, (123), 
(132)}. 

Exercises 

1. For any field F containing the coefficients of an equation having a ra- 
tional root xi and no multiple root, each substitution of its group leaves xt 
unaltered (by property B). If a root x,* is not in F, the group contains a 
substitution which alters Xj. Hence prove that the group of (:r — 1 ) (x -}- 1 ) 
(x — 2) =0 for the field R of rational numbers is the identity. But the 
group of X® — 1 =0 for E is of order 2. 
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2. The group of — 9 a* -f- 9 = 0 for jK is of order 3. Hint: The value of 
(10) is here ± 27. 

3. The group of — 2 == 0 for E f6j), where c«j is an imaginary cube root of 
unity, is of order 3. Use (10). 

4. The group of — 2 = 0 for R is the S 3 mimetric group. For, if it were 
the alternating group, (10) would have a rational value. 

5. Find the group of -+* 1 =0 for R, 

Hints: By Ex 1, §71, yi — xiZo + xz Xi, y^, and yz are the distinct roots 
0, ± 2 of 1/3 — 42/ = 0. Property B shows that each substitution s of G leaves 
2 / 1 , 1 / 2 , and yz formally unaltered. By Ex. 4, §73, s belongs to the group (9). 
Since + 1 is irreducible in R by Ex 4, §80, G is transitive and hence of 
order ^ 4 Thus G = Ga. 

6 The group of x^ -|- 1 =0 for R (i) is of order 2 

Hint: The factors x- ± i show that the group is an intransitive subgroup 
of (9). 

7 If Go X” + x”“i • • • 4“ ofn = 0 has rational coefficients and is irre- 
ducible m E, then 

(a) If there is a complex root of absolute value unity, the equation is a 
reciprocal equation of even degree. 

(b) If there is a root r -f- si, where r is rational, then n is even and the n 
roots may be paired so that the sum of the two of any pair is 2r, whence 
r = — Cl/ (n ao). In particular, if r = 0, the equation involves only even 
powers of x. 

(c) If there is an ima^ary root a hi such that a® -f 6^ js rational, then 
n is even and the n roots may be paired so that the product of any two of a 
pair is a® 4- If p is a square root of a- 4- b-, then p”= On/ao. Replacing 
a: by pt/, we obtain a reciprocal equation in y. 

8. The group ioT R of x + 1 = 0 is a cyclic group of order 

4. It is irreducible in E by Ex. 2, §83. 

9. Let G be the group for F of an equation irreducible in F whose n roots are 
rational functions /» (r) of one root r with coefficients in F for t = 0, 1 , . . . , 
n — 1. Prove that G is a regular group and that f%[fj (r)] is a root. 

91. Rational functions belonging to a group. Consider a rar 
tional fimction i/ixi, , , . ,Xn), with coefficients in a field F, of 
the roots of an equation with coefficients in F. Let G be the 
group for F of the equation. Let 1, a, 5, . . . , fc be all those sub- 
stitutions of G which leave xl/ unaltered in value. Since ^ 
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lias the value zero, we have 4'b — ^ab = 0 by property B. Hence 
so that ab is in the set 1, a, , . , which therefore 

forms a group H. We shall say that ^ belongs to the subgroup 
Hof G. 

Example. Let xi and X 2 == — xi be two roots of + 1 = 0. TLe only 
substitutions of its group (9 ) for R wMch leave Xi^ unaltered are the identity 
and (12) (34). Hence they form the subgroup to which belongs. 

Conversely, let H be any given subgroup of G. Let Fi be any 
?i!-valued function of the roots with coefficients in F, and let 
Vi, Vat . . . , Fib be the functions obtained from Vi by applying 
the substitutions of H. We may choose an integer r such that 

(r-~ Fi)(r~ F,) . . . (r ~ F,) 

belongs to H. For, if s is any substitution of H, then 3, a$, . . . , ks 
are distinct and are in H, and hence form a permutation of 1, 
a, . . . , fc. Thus \l/ is equal to 


(r ^ FJ • • * (r ^ F.J. 


But if s is a substitution of G which is not in H, then is not 
identical with ^ for all values of r, since F* is different from Th, 
Fa, - . . , F^. As in §86, we may choose an integer r such that 
every such \f/s is distinct from Then belongs to H. This 
proves 

Theohem 8. Every rational function \p with coefficients in F of 
the roots of an equation having the group G for the field F belongs to a 
definite subgroup of G, There exist such functions ^ belonging to any 
assigned subgroup of G. 

We next prove the important supplementary 

Theohem 9. If a rational function ^ with coefficients in F of the 
roots of an equation having the group G for F belongs to a subgroup 
H of index u under G, then the substitutions of G replace ik by exactly 
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u distinct functions, called the conjugates to ip under G. They are 
the roots of an equation with coefficients in F which is irreducible 
in F. 

As in §76j let 

(11) G == H + Hg, + Hg,+ - • - + Hgu. 

Let h be any substitution of H. Then 

i'ha,, = {'Ph)g,^ = ^0, ^ 

SO that Ip takes at most u values under G. If ipi — ipj for j < i, 
then xpp == }p, where p — g%gr^ is> Sb substitution of G leaving \p 
unaltered and hence is in ff. Then g^ = contrary to (11). 
Thus Pi, • ^ • ,\pu are distinct. 

Any substitution s of G merely permutes — , pu amongst 
themselves. For, g^ s is in (? and hence by (11) may be expressed 
in the form hg^, where A is in fl"; then 

{Pi)B = pg^B = pkgj^ = paj^ = ph- 

Hence s leaves unaltered their elementary symmetric functions 
which are, apart from sign, the coeflSicients of the polynomial 

(12) g(y) ^ (y - Pi) (y - P 2 ) {y - Pu)- 

Therefore these coefl&cients are quantities in F by property A. 

If g{y) is reducible in F, it has a factor t{y) with coefl&cients 
in F which is zero for y = pi. By property B, t{p 2 ) = 0, , 

t{pu) = 0. Thus t ^ g, so that g{y) is irreducible. 

Example 1. The group 6^ of + ai® -h a; + 1 = 0 for R is {I, (23)} if 
— — 1. The conjugates to ^ = ^2 — o^i under G are — tp and ^2 = 
^ xi; they are the roots of — 2y -f 2 =0, which is irreducible in R. 
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Exasiple 2. The group O of x* + 1 - 0 for S(i) is f/, sj, where s is a 
product of two transpositions. See Ex. 6, §90. Write xi = p, xa = ipjXz = —p, 
2:4 = — ipj where p ~ (1 whence p^ == i. Then s == (13) (24). 

The conjugates Xi and xs to xi under G are the roots of 1/® — i = 0, which is 
irreducible in E (i) since this field does not contain p. 


Theorem^ 10. Let 4> he a rational function with coefficients in 
F of the roots of an equation whose group for F is G. If (j> remains 
unaltered hy all those substitutions of G which leave unaltered another 
rational function i/ of the roots with coefficients in F, then 0 is equal 
to a rational function of \p with coefficients in F, 

Let H be the subgroup of G of index u to which f belongs. By 
means of (11), we obtain the u distinct conjugates 1 ^ 1 , . . . , to 
4/ — under G. Since every substitution h of H leaves 4> un- 
altered, each product hgt replaces 4> by (!>% ^ By the proof 
of Theorem 9, any substitution s of G replaces 4^iby & certain 
4/k, and likewise by Thus, if is defined by (12), the 
fractions in 


x(2/) = giy) ( 

\y — ^1 


<h 

y - f 2 


+ ••• + 


4>u \ 

y - 'Pu' 


are merely permuted by s. Tbe second member is equal to a 
polynomial X (y) each of whose coefficients is therefore unaltered 
by every substitution s of and hence is equal to a quantity in F. 
Taking as y, we get (as in the proof of Theorem 2) 

= \iP)/g'iP). 

Example. Let ^ be an nl-valued function Fj, so that the identity is the 
only substitution leaving 4^ unaltered. Since it leaves every 4> unaltered. 
Theorem 10 states that every rational function <p with coefficients in F is 
expressible as a rational function of Vi with coefficients in F, This follows 
from the like property in §87 of the polynomial numerator and denominator 
of <!>. 

^The case in which the roots are independent variables is due to Lagrange- 
Nouv. M6m. Acad. Berlin, annto 1770-1, §§ 100-4; (Euvres, HI, 374-88. 
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92. Effect on the group by an adjunction to the field. Let G be 
the group of /(x) = 0 for a field F containing the coefficients. The 
field F' = F{ip) composed of the rational functions of with 
coefficients in F is said to be derived from F by adjoining ^ to F 
( §77). If the irreducible Galois resolvent G{V) = 0 for the field 
F remains irreducible in F', the group for F' of f{x) =0 is evi- 
dently G, But if it reduces in F', let (?'(F) be that factor of 
G(F) which has its coefficients in F', is irreducible in F\ and 
vanishes for F = Fi. Then, if Fi, Fa, — ,Vk are the roots of 
G'{V) = 0, the group for F' of /(a;) = 0 is by definition G' = 
{ly a,, .. jk}, which is a subgroup of G. The former result may 
be combined with this since a group is included among its sub- 
groups. Hence we have 

Theobbm 11. By an adjunction to the field, the group of an 
equation is reduced to a subgroup. 

Example 1. For the field R of rational numbers the group 
a; + 1 = 0 is <r = {/, (23)} if xi denotes the root — 1; but in the enlarged 
field R (t), the group is the identity (Example in §88). Note that the adjoined 
quantity i = Xa belongs to the identity subgroup of G. 

Example 2. The group for R of a;* 4- 1 ~ 0 is G4, given by (9). In the 
notations of Ex. 2, §91, the group for R{i) is ft = {/, (13) (24) }. The latter 
is the sul^oup of ft to which i = xi* = zs* = — rs® = — belongs. 

These examples illustrate also the important 

Theoeem 12. Let G he the group for F of an equation with coeffi- 
cients in F. Let H be the subgroup to which belongs a rational func- 
tion of the roots with coefficients in F. By the adjunction of ^ to F, 
the group of the equation is reduced from G to H. 

It is to be proved that H has the characteristic properties A 
and B of the group of the equation for the field F^ = F(i^). 
Evidently any rational function <f> of the roots with coefficients in 
F' is equal to a rational function of the roots with coefficients 
in F. 
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First, let (j> be unaltered by all of the substitutions of H, By 
Theorem 10, <pi (which is unaltered by H) is a rational function 
of 4/ with coefficients in F. Hence — <^> is in F' == F(^), so that 
property A holds for H and F'. 

Second, let 4> be equal to a quantity r in F', namely a rational 
function r(^) of ^ with coefficients in F. Then — r(^) is a 
rational function of the roots with coefficients in F having the value 
zero, and hence is unaltered by every substitution of G and, in 
particular, by the substitutions of H, The latter leave r(^) 
unaltered and therefore also <i>i = 4>. This proves property B for 
H and F'. 


Exercises 

1 . By the adjunction of 2 the group ft of 4- 1 = 0 for F {i) is reduced 
to the identity group. 

2. By the adjunction of an imaginary cube root (a of unity, the group 
for F of — 2 =0 is reduced to the cychc group Cs (Exs. 3, 4, §90). Verify 
that 03 = x^lzi belongs to the group C 3 . By the further adjunction of 2 ^, the 
group is reduced to the identity. 

3. Find the group oi -h x + 1 = 0 for F(5D- 

93. Group of the general equation. Let the coefficients cx, . . . , 
Cn of (1 ) be independent complex variables. Let xi, . . , be the 
roots in the sense of §79. 

Theorem 13. The group of the general equation for the field F 
determined by its coefiicients and any constants finite in number is 
the symmetric group. 

Suppose the group is i? = {1, a, , . , , fc}, which does not con- 
tain the substitution s. Apart from sign, the elementary symmetric 
functions of Fi, Fa, . . . , are coefficients of the Galois resolvent 
and hence are equal to quantities in F. Hence if r is in F, 


Fx)(r- Fa) (r - F,) 
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is equal to a quantity in F. As in §91, we may choose an integral 
value of r such that ’ips is distinct from But is in F and hence 
is expressible rationally in terms of the S3Tiimetric functions 
Cl, ... j Cn of the x’s and is therefore unaltered by s. This con- 
tradiction proves the theorem. 

94. Further results. By an elaborate analysis, Hilbert^ proved 
that if / is a polynomial in xi, Xr, h, ... ,ta with integral 
coefficients which is irreducible in i?, it is alwaj^s possible in an 
infinity of wa^^s to assign integral values to the G such that / 
becomes a function of the Xr irreducible in R. Let <t> (a:) = 0 be an 
equation in x whose coefficients are poljmomials in . . . , with 
integral coefficients. We can assign integral values to the G such 
that the resulting equation in x has the same group for R as 
4>{x) = 0. Applying this to the general equation, we conclude the 
existence of an infinitude of equations of degree n with integral 
coefficients whose group for R is the symmetric group. Similarly 
for the alternating group. The last two results have been proved 
more simply by various writers.^ 

A beginning^ has been made on the problem to find all equations 
of degree n with a prescribed group, with explicit results for = 3 
and n = 4. 

An equation whose coefficients are rational functions of a com- 
plex variable k has a group wffiich contains as an invariant sub- 
group the monodromie group composed of the substitutions on the 
roots which arise when k describes a closed path in the complex 

^ Jour, fiir Math., 110, 1892, 104-29. In Ofversigt Finska Vetenskaps-Soc. For- 
handJingar, 59, 1916-7, No. 12, Waisala proved that integral values may be 
assigned to the ij such that / decomposes into exactly as many irreducible factors 
as when the tj are variables. 

^ Weber, Math. Papers Chicago Congress, 1896, 401-7; Algebra, ed. 2, I, 1898, 
653-5. MalHet, Jour, de Math., (5), 5, 1899, 205-16. Bauer, Jour, fiir Math., 132, 
1907, 33-35; Math. Amnalen, 64, 1907, 325-7. Schur, Jahresber. Deutsch. Math. 
Vereinigung, 29, 1920, 145-50. Furtwan^er, Math. Annalen, 85, 1922, 34-40. 

®E. Noether, Math. Annalen, 78, 1918, 221-0. Seideimann, Hid.., 230-3. Breuer, 
ibid.t 86, 1922, 108-13 (cyclic sexties). 
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plane.^ Similarly there is a monodromie group of a linear differ- 
ential equation obtained by using linearly independent integrals 
(instead of roots as before). 

The group of a system of algebraic equations has been con- 
sidered.® 

The groups of a reciprocal quartic® and De Moivre’s’' solvable 
quintic® have been found. 

The group of the equation for the nine abscissas of the points of 
inflexion of a plane cubic curv’e has been treated fully by the 
author.® A more elaborate discussion is required for the 27 straight 
lines on a cubic surface or the 28 bitangents to a plane quartic 
curve.“ 

The Galois theory has been extended to ordinary linear differ- 
ential equations^^ and to complete systems of linear partial differ- 
ential equations.^® 


‘^Miller, BlicHeldt, and Dickson, Finite Groups, 378-81. Hermite, Comptes 
Rendus Paris, 32, 1851 (CEuvres, I, 276-SO). Frobenius, Jour, fiir Math., 74, 1S72, 
254-72. Kneser, Irreducibiiitat und Monodromiegruppe alg. Gi , Thesis, Berlin, 
1885; Math, ^innalen, 28, 1886, 125-32. Hurwitz,«6id.,39, 1891,23. BiancM, Teoria 
dei Grappi di Sostituzioni, 1897, 314-33. Baker, Annals of Math., 14, 1912-3, 
119-36. Ritt, Trans. Amer. Math. Soc., 24, 1922, 21-30. 

5 Weisner, Bull. Amer Math. Soc., 30, 1924, 314-6 (concept due to Konig, Math. 
Annalen, 18, 1881, 69-77). 

® Dickson and Borger, Amer. Math. Monthly, 15, 1908, 71-78, 85-87. Miller, 
BHchfeidt, and Dickson, Fimte Groups, 1916, 291. Criteria for irreducibility of a 
reciprocal equation of degree 2n, Dickson, BuE. Amer, Math. Soc., 14, 1907-8, 
426-30. 

^PhE. Trans. London, 25, 1707, 2368-70. 
s Borger, Amer. Math. Monthly, 15, 1908, 171-4. 

®MiEer, BHchfeidt, and Dickson, Finite Groups, 1916, 327-42; Annals of Math , 
16, 1914, 50-66. 

343-75. On pp. 375-7 is a brief account of the groups of further geometri- 
cal problems. 

Picard, Traits d’Analyse, III, Ch. 17. Loevpy, H. Weber Festschrift, 1912, 19 S- 

227 . 

i^Vessiot, Annales Sc. £cole Normale Sup., (3), 21, 1904, 1-85. 



Chapter X 

EQUATIONS SOLVABLE BY RADICALS 

95. Historical note. Euler, the leading mathematieian of the 
eighteenth century, believed that every algebraic equation is 
solvable by radicals. Beginning in 1799, Ruffini published several 
attempts to prove that the general equation of degree > 4 is not 
solvable by radicals, employing substitutions on the roots. His 
final and simplest proof ^ is essentially the same as that now known 
as Wantzel’s simplification- of Abel’s proof; but he gave only a 
trivial remark by way of proof of the auxiliary theorem, now known 
as Abel’s, that, if an equation is solvable by radicals, the roots can 
be given a form such that afi the radicals occurring in their ex- 
pressions are rational functions of the roots [and roots of unity, 
with rational coefficients], 

Abel’s® proof of the last theorem has two defects. As noted by 
Sir W. R. Hamilton,^ the nth root of gjp'’ is in general of order 
higher than pV". This first defect is best remedied by ignoring 
Abel’s unnecessary classification of radicals. Second, Abel’s proof 
is vitiated by the occurrence of roots of unity and is remedied by 
adjoining to the field at the outset aU Mb roots of unity, where k 
takes the values of the indices of all the radicals involved in the 
expression for a root y. 

But to give a sound proof of the impossibility of solving the 
general equation of degree > 4 along the lines of Ruffini and Abel, 
we must introduce the ideas of groups of substitutions and adjunc- 


^ Riflessioiii intomo alia soluzione delle 
1813. 


equazioni algebraiche generali, Modena, 


^Serret, Algebre, II, eds. 4 or 5, 512. 

® Jour, fiir Math.., 1, 1826, 65-84; CEuvres completes, I, 1881, 66-94. 

^Tr^. Royal Irish Acad., 18, 1839, 171-259. A very compHcated reconstruo- 

non of Abel s proof. 
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tions to fields, whieh served Galois in his far more general theory 
of the solvability of any equation. 

96. Solvability by radicals. The four rational operations are 
addition, subtraction, multiplication^ and division. 

An algebraic equation is said to be solvable by radicals if all of 
its roots can be found by rational operations and extractions of a 
root, these operations being performed a finite number of times 
upon the coeflBcients of the equation or upon quantities obtained 
from them by those operations. Every cubic or quartic equation is 
solvable by radicals in -view of Ch. VII. One of our chief aims is 
to decide whether or not every equation of the fifth or higher 
degree is solvable by radicals. 

The above definition permits the use of the operation of finding 
one of the pth roots of a quantity previously determined, but not 
the use of the operations of finding all its pth roots. The use of the 
latter operations would imply a knowledge of all the pth roots of 
unity, whereas one of our aims is to prove that, when p is a prime, 
they are aU expressible in terms of radicals whose indices are < p. 

The solution of an equation solvable by radicals is usually ac- 
complished by the solution of a series of auxiliary equations the 
roots of any one of which can be found by rational operations and 
root extractions performed upon its coefiGicients and the coeffi- 
cients and roots of the preceding equations of the series. In order 
to focus our attention upon a particular equation of the series and 
to have a flexible phraseology, it will prove convenient to employ 
the following generalization of the foregoing definition of solvabil- 
ity by radicals. 

An equation with coefficients in a field R{kij . . . , fe») shall be said 
to be solvable hy radicals relatively to that field if all of its roots 
can be found by rational operations and extractions of a root per- 
formed upon kij ... ,km or upon quantities derived from them by 
those operations. 

For example, = 2^® is evidently solvable relatively to S(p), 
where p is a particular imaginary 13th root of unity. 
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97, Resolvent equations. Let G be the group for a field F of a 
given equation /(x) = 0 with coefficients in F. A rational function 
4^ of the roots with coefficients in F belongs to a certain subgroup 
H of index u under G, By Theorem 9 of §91, ^ is a root of a re- 
solvent equation of degree it with coefficients in F. Assume that 
this equation is solvable by radicals relatively to F. By adjoining 
its root 4/ to F, we obtain a field Fx = F(4/) for which the group 
of f{x) = 0 is now H ( §92). Assume that successive such adjunc- 
tions lead to a field Fk for which the group of /(x) = 0 is the 
identity. Then the roots all belong to Fk by property A of §89. 

The assumptions just made wiU be shown to hold true when 
f{x) is + hx^ + cx d, ii b, c, d are independent variables, 
and F is the field R (oj, b, c, d), where o) == — | + i(— 3)= is an 
imaginary cube root of unity. By §93, G is the symmetric group 
G& on the roots xi, Xc. To the cyclic subgroup Cs belongs the 
alternating function 

a == (xi — X 2 ) (xo — X 3 ) (x3 — Xi), 

whose square is the discriminant A of the cubic. The resolvent 
equation d^= A is solvable by radicals relatively to F. Adjoining 
d = A ^ to F, we obtain a field Fi = F (d ) for which the group of 
our cubic equation is C 3 . We employ the linear functions <!> and 4 ^ 
of §66 and recall that, in the exercise following it, we verified 
that the substitution (132) of C 3 replaces <f> by o)<j> and 4^ by 02 ^, 
Thus all the substitutions of Cz leave (j>^ and 4 ^ unaltered- The 
latter are equal to rational functions of d with coefficients in F 
(Theorem 10 of §91). The computations in §§66, 67 give 

# = ilB + 3d(^ 3)^] 4 ^^ |[F - 3d(- 3)^] = h, 

= 52 — . 

where B == 9bc — 26® — 27d. The resolvent equation 
where g belongs to the field Fi = F(d), is evidently solvable by 
radicals relatively to Fi. In the enlarged field F 2 = Fi(g^)^ 
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the group of the given cubic equation is the identity group to 
which 4> belongs. Hence its roots Xi all belong to F 2 ; their expres- 
sions in terms of quantities in Fo are given by formulas (5) of §66. 

98. Group of a resolvent equation. In our later discussion of 
the solvability by radicals of the resolvent equation defined at the 
beginning of §97, we shall need its group. Let belong to the 
subgroup H of index u under (?. In §91 w^e proved by means of 

(1) (? = H + Jigs + Hgs + • • • + Hg, 

that ^ is one of exactly u distinct conjugates under the group Gi 

( 2 ) 

and that any substitution s of <? replaces these by 
(3) 4'g2Sf 4^azS} • • • ? 

which are merely the functions (2) rearranged. Hence to any 
substitution s of G on the letters Xi^ . . , ,Xn there corresponds the 
substitution 



on the u letters (2). Similarly, to i of G corresponds 



where in the upper row of the final symbol we have adopted the 
order (3) of the letters (2). This is permissible since a substitu- 
tion is not changed if we rearrange the columns in its two-rowed 
notation. The product o-r therefore replaces by and 
hence is equal to a substitution of t 3 rpe (4) which is derived from 
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(4) by replacing s by st In other words, the set of all substitu- 
tions (4) is a group P. This group is transitive since s may be 
taken to be any 

Theobem 1. To each substitution s of G corresponds a unique 
substitution a defined by (4). All such substitutions a form a transi- 
tive group r. 

If to t corresponds r, we saw that to st corresponds err. In other 
words, the correspondence between substitutions of G and those 
of r is preserved under multiplication. For this reason, G and T 
are said to be isomorphic. According as the order of G is equal to 
or exceeds the order of F, (? is called simply or multiply isomorphic 
to r. 

Example. Let G be the alternating group on the independent variables 
Xif ... f X 4 . We have 

<? = + (?4(234) + (?4(243), 

(?4= {J, (12) (34), (13) (24), (14) (23)}. 

The indicated substitutions of order 3 replace Tp = (xi — X 2 ) (x^ — X 4 ) by 
1/^2 = (xi - Xa) (X4 — X2) and ^3 = {xi — X4) (xa — x&). Each substitution of 
Gi leaves ^2, and ^3 unaltered.^ Hence ^ belongs to G 4 and has only three 
conjugates under G. We have 

r = {/, (4^3 M]. 

For, to each substitution of G 4 corresponds the identity of T. To all sub- 
stitutions of the set 0^(234) therefore corresponds the same substitution 
ii'fi ^s) of r. Hence G is (4, 1) isomorphic to F. 


Exercises 

1. If is the symmetric group and \p is the alternating function, V = 
{/, {4nPt)}. 

2. If (x is the symmetric group on Xi, . . . , X4, the group F on == xi 4- 
X3 Xi, ytj yz is the s3nDQmetric group of order 6. 


^Thia was shown in Ex. 4 of §73. See (22) of §71. 
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3 If U is tlie symmetric group on zi^ z^j Xs, and if ^ is the six-valued function 
ji Zt-\- (joXzj r is simply isomorphic to G (cf. §67, Ex 5). 

4. The group F is identical with that obtained by using another rational 
function 4> belonging to H and having its coefficients in F. Hints: By §91, ^ 
is a rational function r (^ ) of i/' with coefficients in F. By property B, — r (4^) 
is unaltered by every substitution t of G, whence <f>t ~ r(xpt) Hence to any 
substitution on the u letters (2) corresponds the same substitution on the 
distinct letters 0, 4>g ^^ The converse is true since we may inter- 
change the rdles of and 

5. Give another proof that F is determined by G and H alone, and is inde- 
pendent of If s is any substitution of G, prove that the products of the u 
sets in (1 ) by s on the right are the same sets rearranged, and that the result- 
ing substitution on the u sets is the same as (4) apart from the notation of 
the letters operated on. 

99. The special importance of the group T is due to 
Theorem 2, T is the group for F of the resolvent equation 

(5) g{y) = ( 2 / ~ ^i ) (y - ^ 2 ) (y - M = 0 

with coefficients in F, Here 4^% denotes 4/ 

To prove that T has the characteristic properties A, B of the 
group of (5) for F, note that any rational function R (^i, . . . , ) 

with coefficients in F is equal to a rational function r (a?i, . . . , r„) 
with coefficients in F: 

( 6 ) R (4^1, • . . , — r(xij . . . j Xn) == 0 . 

Since this difference is equal to the number zero of F, it is unaltered 
by every substitution s of the group G on iCi, , * . , To s corre- 
sponds a definite substitution <r of the group F on . . . , 
Hence 


(7) 


. . . , — n(%, . . . , = 0. 
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First, let R be unaltered by every substitution of T, so that 
R ^ for every a in F. Comparing (6) with (7), we conclude 
that r == fa for every s in (?. Hence by property A for the group 
(j, r belongs to F. This proves property A for the group F. 

Second, let R belong to F. By (6), r belongs to F. Hence by 
property B for the group G, — r for every s in G, Then by (6) 
and (7), = R for every <r in T. This proves property B for the 

group r. 

Since T is transitive, equation (5) is irreducible in F. 

100, Invariant subgroup. Let ip belong to the subgroup 

H = {hi = If h2j ^ , hp} 

of G, Let h and s be any substitutions of H and G respectively. 
Then t = hs leaves unaltered, since replaces 4's by 
h leaves i/ unaltered, and s replaces 4^ by Conversely, if t is 
any substitution of G which leaves unaltered, so that 
then 

where h is in H, whence t == hs. Hence \p8 belongs to the 
subgroup 

{s~^ hiS — I, /i 2 s, . . . , hp s} 

of G. This subgroup is denoted by Hs and is called the trans- 
form of H by s. Its substitution hi sis called the transform of 
/i, by s (cf. §101). 

Theorem 3. Let ^ belong to the subgroup H of index u under G. 
By means of (1), we obtain a complete set of conjugates to 4/ under 
G: 
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Theij heloThg to the respective groups 

H, Hg2 , . . . , Hg^. 

The latter groups are said to form a complete set of coBjugate 
subgroups of (?, In case they are all identical, H is called an 
invariant (or self-conjugate) subgroup of G. In this important 
case, the substitution (4) is the identity if 5 is in H, since s then 
leaves each unaltered; while any substitution s of G and the 
product hSj where h is in H, both correspond to the same sub- 
stitution (4). In other words, if we consider the sets H, Hgz, 
etc., which by (1) make up all the substitutions of G, w'e see that 
all substitutions of H correspond to the identity of the group F, 
all of Hgz correspond to the same substitution of F, etc. To 
obtain F it therefore suffices to take the substitutions w’hich 
correspond to I, go, , Qu* Since these replace by u distinct 
functions, the resulting u substitutions of F are distinct. This 
proves 

Theorem 4. If H is an invariant subgroup of G of index Uj the 
group r is a transitive group of order u on u letters and hence is a 
regular group. 

If is a prime and cr is any substitution 5^ I of F, the order of 
or is u ( §76, Exs. 2, 3), so that the powers of or give all the sub- 
stitutions of F. This proves the 

Corollary. If H is an invariant subgroup of G of prime index 
Uy then F is a regular cyclic group of order u. 

This corollary is illustrated by Ex. 1 and the example preceding 
it in §98. For each we shall prove in the next section that H is 
invariant in G, 


^ In otRer words, B is invariant in G if J? is transformed into itself by every sub- 
stitution of G. 
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101, Transforms of a substitution. There is a simple rule for 
finding the transform g~^ hg of h by g without performing the 
indicated multiplications. Let A be a product rst • • * of cycles 
affecting different sets of letters. Evidently 

g-^ hg = g-^ rg • sg ■ • ■ . 

It remains to find the transform of a cycle r = (abc . . . e) by 

^ ~\A BC ■■■EF ■■■LJ 

Then 

_ /A B C---E F ■■■L\ 

^ \abc---ef •■■I/’ 

/ABC---EF--L\ 

^ ~ \B C D--- AF --l)' 

Hence g~‘^Tg = {ABC . . . E) may be obtained by replacing 
each letter of the cycle r == {abc , . , e) by the letter by which g 
replaces it, or expressed briefly by applying g within the cycle. 
Hence hg is obtained by applpng g within the cycles of A, 
For example, 


(123)"^ • (12) (34) • (123) = (23) (14). 

Since any substitution transforms a product of k transpositions 
into a product of k transpositions, and therefore an even sub- 
stitution into an even substitution, it follows that the alternating 
group on n letters is invariant in the symmetric group. The case 
ft = 2 shows that 


(?4= {/, (12)(34), (13)(24), (14) (23)} 
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is invariant under the symmetric group U 24 on four letters^ since 
Ga contains every substitution (ab)(cd), where a, bj c, d form a 
permutation of 1, 2, 3, 4. 

A substitution is called invariant in G if it is transformed into 
itself by every substitution of G. 


Exercises 

1 . (74 is an invariant subgroup of the alternating group on four letters and 
also of the groups 0%^ Rg, Eg of §73. 

2. If R is a subgroup of G of index 2, E is invariant in G. Hint: G = H + 
Eg = H + gE, whence the substitutions of the set Eg form a permutation of 
those of gE. 

3. The only invariant substitutions of Gb in §73 are the identity I and 
a = (12) (34). Hence the only invariant subgroups of order < 4 of Gg are 
the identity and {/, a}. 

102. Simple and quotient groups. A group is called simple if 
it has no invariant subgroup other than itself and the identity 
group. A composite group is one which is not simple. A group 
whose order is a prime is evidently simple. 

Let H be an invariant subgroup of index of a group G, By 
§91 there exists a function yp belonging to the subgroup £?. Then 
the group F defined in §98 is of order u (Theorem 4 of §100); it 
is called the quotient group of G by and denoted by G/ff. Its 
order is the quotient u of the order of G by that of H. 

For example, if Ge and Gz are the symmetric and alternating 
groups on three letters, the alternating function ypi belongs to Gz 
and it takes a second value ^2 = — under Ge. Then Ge/ Gz is 
the group F = {I, {ypiyp 2 )}- 

A proper subgroup of G is a subgroup distinct from G. 

In particular, let H be a maximal invariant proper subgroup of 
G, so that H is not contained in a larger invariant proper subgroup 
of G. 
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Theorem 5. The quotient group T = G/H is simple. 

For, suppose that T has an invariant subgroup A distinct from 
r and the identity group. Write A = {q:i = I, a 2 , . . . , ad}, 
where ai corresponds to aU the substitutions of a set Hat form- 
ing a component of G in the notation (1). Write 

D = Hai -f Ha2 Had, ai = I. 

Since A is a group, whence Ha^ ■ Hoj = Hau, 

so that D is 2b group. To every substitution g oi G corresponds a 
substitution y of F, Since H is invariant in G, all the substitu- 
tions of the set g~^ Haig ~ H'g~^atg correspond to y'^cay, 
which is a substitution aj of the invariant subgroup A of T. 
But the substitutions of Haj alone correspond to a^. Hence 
g'-^ Hot g = Haj, so that D is invariant in G, Since A is distinct 
from r and the identity, D is distinct from G and H, Hence H 
is contained in the larger invariant proper subgroup D of G, 
contrary to hypothesis. 

103. Series and factors of composition, solvable groups. Any 
group G contains the identity group Gi, which is invariant in G, 
Since there is only a finite number of sets of substitutions in a 
group G of finite order, G has a finite number of subgroups. 
These two remarks show that G has a maximal invariant proper 
subgroup H, Similarly, H has a maximal invariant proper sub- 
group K, Such a series of groups G, H, K, , , , M, Gi, terminating 
with Gi, is called a series of composition of G, If H is of index u 
under G, K of index v under H, , , , , and if m is the index of Gi 
under M (i.e., the order of M), the positive integers u, v, , , , ,m 
are called the factors of composition of G (relative^ to the series 
Gi), 

^ But in fact the same for all series of compositioii of a result due to Jordan, 
Trait4 des Sutetitutions, 1870, 42. It is a corollary to the theorem of Holder 
(Math. Annalen, 84, 1889, 37 ; simplified by G. L. Brown, Bull. Amer. Math. Soc , 
i, 1S94--5, 232-4) that the quotient groups G/H, H/JX, . . . , M are identical in 
some order with those of any series of composition of G, Cf . Miller, Blichfeldt, and 
Dickson, Theory and Applications of Finite Grouj^, 1910, 175-~6. 



§ 103] SERIES AXD FACTORS OF C0:MP0SITI0X 189 

A group is called solvable if its factors of composition are all 
primes^ otherwise insolvable. 


Exercises 

1. If (?G and <js are the symmetric and alternating groups on 3 letters, the 
only senes of composition of G& is Gb, 0 ^, Gi, and Gb is a solvable group. 

2- A cyclic group of prime order is a solvable group 

3. The symmetric group on 4 letters has the series of composition Ga, 
Gi2 = alternating group, <?4 Cm Ex., §98), 6*^2, Gi, there being three choices 
for (j2. Hence O24 is a solvable group. 

4. If Cu is generated by a = (xix^ “ • xn), the only proper subgroups are 

Cb ~ {I, a^, C 4 = {/, a^, a'’ j, Cs = {/, C 2 = {I, 

and Cl = {I\. Every subgroup of a commutative group is invariant. The 
only series of composition of C12 are 

C12, Ce, G, Ci; Ci2, C6, Co, G; C12, G, G, G. 

The factors of composition are 2, 2, 3; 2, 3, 2; 3, 2, 2, respectively. 

5. If G has a subgroup H of index 2, H a subgroup K of index 2, etc., to the 
identity group Ci, then G has the series of composition Kj . , , G- 

6. Find the seven series of composition of G in §73. 

7. Prove that the three series of composition in Ex. 3 are the only ones. 

104. Theorem 6. The solution of an equation having the group 
G for the field F can be reduced to the solution of a series of equations 
each having a simple regular group for the field obtained by adjoining 
to F a root of each of the earlier equations of the series. In particular ^ 
if G is a solvable group, each auxiliary equation has a regular cyclic 
group of prime order. 

Let G, H, K, , Gi be a series of composition of G, and m, 
V, , be the corresponding factors of composition. Construct 
a rational fxmction of the roots Xi with coefficients in F such that 
^ belongs to the subgroup £f of G of index u ( §91). Then ^ is a 
root of an equation of degree u with coefficients in F whose group 
r for F is simply isomorphic with the simple quotient group 
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G/H (§§99, 102). After the adjunction of the root f to F, the 
group of the given equation becomes H for the field F(i/) by 
§92. 

Construct a rational function % of the Xi with coefficients in 
F(^) such that x belongs to the subgroup K of H of index v. 
Then x is ^ root of an equation of degree v with coefficients in 
F(^) whose group for that field is simply isomorphic with the 
simple group H/K. By the adjunction of x, the group of the 
given equation becomes K for the field F(^f/J x)- 

The final such step is the adjunction of a function belonging to 
(?i. The resulting field contains each Xi. 

105. Equations with a cyclic group. The last result is sup- 
plemented by the following 

Theorem 7. An equation, whose group G for a field F containing 
an imaginary pth root p of unity is a regular cyclic group of prime 
order p, is solvable by radicals relatively to F. 

Let the roots be xo, Xi, — , Xp^i and let G be generated by the 
substitution s = {xqXi - • - Xp^i). We see that s replaces 

(8) Ti = Xq+ Xi + 3h+ h Xp-i 

by Ti and hence leaves Vi = unaltered. Since the coeffi- 
cients of Vi belong to F and it is unaltered by aU the substitutions 
s^ of G, v^ is equal to a quantity in F by property A of §89. Evi- 
dently — To is a coefficient of the given equation. The p linear 
equations (8) for i = 0, 1, . . . , p — 1 can be solved for the p 
unkn owns Xq, — , Xp^i by the method employed in §66 for the 
case p = 3. Multiply (8) by p^*^\ where 0 g j < p, and sum for 
i = 0, 1, . . . , p — 1. In the resulting relation, the coefficient of 
Xj is 1 + • • • + 1 = while, for k 7 ^ j, the coefficient of Xk is 

— 1 
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since t is not divisible by p. Hence ^ 

p-i 

pXj = ro+Y, p“‘’ 0 ’ = 0 , 1 , . . . , p — 1 ). 

i-.l 

106. Cyclotomic equations. To perfect the last theorenij it 
remains to prove that we can find by root extractions the imagi- 
nary pth roots of unity, where p is an odd prime. They are the suc- 
cessive powers of one of them, say p. For p = 5, they may be 
arranged in the order 


p, p2, p2^ = = p3, 

so that each is the square of its predecessor. 

Since the discussion does not become more complicated, we shall 
treat the generalization from p to p* and hence obtain results 
needed also in our later study of regular polygons. In §83 we 
saw that the primitive ninth roots of unity are 

n n2 — fJ — a^ 

Ft P f H J P } H P t P P ! 

each of which is the square of its predecessor, and that the e = 
_ p*-i prinaitive p*th roots of unity are those powers of one of 
them, say p, whose exponents are the positive integers less than 
p® and not divisible by p. The above facts concerning fifth and 
ninth roots of unity may be generalized as follows: It is shown in 
the theory of numbers that there exists an integer g such that, 
when 1, g, . . . , are divided by p®, the positive remainders 
< p* coincide in some order with all the positive integers < p® 
which are not divisible by p, and hence give the mentioned ex- 
ponents of p. In other words, the primitive p® roots of unity are 

Xi-pjX2 = P% Xs = p^, jXe - 

^Lagrange, Traits i^aoluUon. Equations, 18(^, Note XIII, §16. Implicitly in 
Nouv. M6m. Acad. Berlin, ann§e 1770, §71; OEuvres, III, 333. 
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Hence 

(9) Xz - X2^, ^ . ,Xe - X^e-h = ^e^- 

The final relation follows from Euler’s theorem that is of the 
form 1 + kp% where k is an integer (as shown in the theory of 
numbers). Compare the above cases p® = 5, = 9. 

We proved in §83 that the equation having these roots a;i, . . . , 
Xe is irreducible in the field R of rational numbers. Hence its group 
G for R is transitive. Let 

_ / Xi Xo Xz- — Xe\ 

\Xa Xb Xc'-Xi/ 

be any substitution of G. By property B of §89, it follows from 
(9) that 

Xb = Xa^, Xc = Xb^j . . . , Xa = Xi^, 

But Xa -^1 = Xa^, etc., by (9). Hence 


Xb ~ ^54-1? • * * j Xa 

provided the new symbol Xe+i be replaced by Xi, If two integers 
u and V differ by a multiple of e, we write u ^ v (mod e). Hence 

6^a + ljC^6 + l^a4-2, ... (mod e), 

^ ^ / iTi X2 Xz Xk -'Xe \ 

V Xa 2!Jo-}-l Xa+2 ‘ * * 1 * * * Xq^o—I ^ 

in which is to be replaced by Xj. Hence 

S = ^ = {^\X2Xz ' — Xe)- 

Since G is transitive it contains all the powers of t 

Theoeem 8. If p is an odd prime, the group for the field of ra- 
tional numbers of the cydotomic equation whose roots are the primitine 
pHh roots of unity is a regular cydic group of order e = p^ — p^-\ 

This theorem was verified for p«=5 in Ex. 8, §90. 
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107. Sufficient condition for solvability by radicals. An equa- 
tion ixaving a solvable group for the field determined by the 
coefficients is solvable by radicals. We shall prove the more 
general 

Theorem 9, An algebraic equation having a solvable group for 
any field F containing the coefficients is solvable by radicals relatively 
to F. 

Assume for the moment that the theorem holds for every equa- 
tion having for F a regular cyclic group of prime order. We shaH 
then prove that any equation /(x) = 0 whose group for F has 
prime factors of composition r, . . . is solvable by radicals 
relatively to F. As in §104, there is a series of equations n{-iy) =0, 
m{x) = 0, ... of prime degrees u, v, , the solution of which 
is equivalent to the solution of /(x) =0. As there proved, the 
group for F of n(^) = 0 is a regular cyclic group of prime order 
u, which by our assumption is solvable by radicals relatively to F. 
The coefficients of m(x) == 0 belong to the field Fi = F{\p) and 
its group for Fi is a regular cyclic group of prime order r( §104); 
by our assumption it is solvable by radicals relatively to Fi. It 
is therefore solvable by radicals relatively to F, since the same was 
just proved for n(i/) = 0. Similarly, all of our auxiliary equations 
are solvable by radicals relatively to F. Since the roots of /(x) = 0 
are expressible rationally in terms of quantities in F and roots 

X, • • - oi these auxiliary equations ( §104), we see that/(x) = 0 
is solvable by radicals relatively to F. 

It remains to justify the assumption made above that any 
equation C (x) = 0 having a regular cyclic group G of prime order 
p for a field F is solvable by radicals of indices ^ p relatively to 
F. This is evidently true when p — 2. To proceed by induction 
on p, suppose that every equation hamng a regular c^xlic group of 
order q, where g is a prime < p, for any field D is solvable by 
radicals of indices g q relatively to D. As in the preceding para- 
graph, this implies that the equation for the imaginary pth roots 
of unity is solvable by radicals of indices < p (i.e., relatively to 
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the field R of rational numbers). In fact, its group for iZ is a regu- 
lar cyclic group of order p — 1 ( §106), whose factors of com- 
position are the prime factors of p — 1, and these primes are < p. 

Adjoin to F an imaginary pth root p of unity. The group of 
C(x) = 0 for the field F(p) is either the initial cyclic group G of 
prime order p or the identity group ( §92), In the latter case, the 
roots are in F(p) and can be found from quantities in F by ration- 
al operations and extractions of roots of indices < p, since p was 
seen to be derivable from rational numbers by those operations, 
so that C{x) =0 is solvable by radicals of indices < p relatively 
to F. In the former case, C{x) = 0 is solvable ( §105) by radicals 
of indices ^ p relatively to F(p) and hence, as before, relatively 
to F, 

Hence the induction is complete and the theorem is proved, 

CoROLLART. If p is au odd primey the equation for the p — 1 
imaginary pth roots of unity is solvable by radicals of indices < p. 

Example 1. If 6, c, c? axe independent variables and a> is an imagmary cube 
root of unity, the group of -{■ + cx -h d = 0 for the field iZ(«, h, c, d) 

is the symmetric group on the three roots (§93). Since it is a solvable group 
by Ex. 1, §103, the equation is solvable by radicals (cf. §97). Note that after 
the adjunction of the square root of the discriminant, the equation has a 
cyclic group of prime order 3. 

Exaxipie 2. Consider the quartic equation whose coeJBScients are inde- 
pendent variables. For the field F determined by its coefiScients, the group is 
the symmetric group G 24 on the roots. It is a solvable group by Ex. 3, §103. 
To the invariant subgroup On, composed of the even substitutions, belongs 
the product P of the difierences of the roots. The square of P is the dis- 
criminant A of the quartic equation. The group reduces to Gn for the field 
Fi =a F(P). The function yi = xi + xz xa is unaltered by G 4 = {/, 
(12) (34), (13) (24), (14) (23)}, while the remaining eight substitutions of 
Git replace yi by b or ya. These three y^s are by §91 the roots of a cubic equa- 
tion with coefficients in Fj (the resolvent cubic in §71 ). Since G 4 is invariant in 
Gitj the group of this cubic for the field F (P) is Gn/Gi, which is a cyclic group 
of order 3. By the adjunction of yi to Fi, the group of the quartic reduces to 
O4. The function X2 is unaltered by ft = j J, (12) (34) }, while the remain- 
ing substitutions of ft replace it by xs x*. The adjunction of a root of 2^ — 
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yiz Xi Xi X 3 X 4 = 0 reduces G 4 to the invariant subgroup Finally, the 
adjunction of Xi — X 2 whose square belongs to Fi(?/i, Xi 2 * 2 ) reduces G 2 to the 
identity group. The enlarged field contains Xi, . . , X 4 , which have been 
found by the extraction of three square roots and one cube root. 


Exercises 

1. Instead of we may employ in Example 2 

fi2 = 42/1 4. 62 _ 4 c, h ^ xi -r xi — xz — Xa, 

where the notations of §71 are used. Evidently the transformation y - 
J (i — 52 -h 4c) replaces the former resolvent cubic by one in i whose roots 
are the squares of h, h ~ Xi Xz — xa, tz = Xi -f- ^4 — 22 — X 3 . From 
these three linear relations and — h — 2 xij determine the roots x^. 

2. Modify Ex 2 by adjoining the cube root co of unity to the initial field 

and employing instead of yi the function <f> = yi yz- Write 

^ = 2/1 + 2/2 -f w 7/3 Appl 5 dng to the resolvent cubic (19) in §71 the results 

in §66 with x^s changed to y’s, show that 

<^3 „ ^3 ss 3 ^ jP^ = c2 — 36 d l2e, 

<^3 + ^3 = 2c3 - %hcd - 72ce + 27^^ -f- 2762 


which are essentially the invariants J and J of §12. Hence ^ is found as a cube 
root and then 4' is known. From the values of 4 , and t/i + 2/2 + I /3 = c, 
we obtain yi, 2 / 2 , 2/3 by solving three linear equations (cf. §66). For the field 
Fi obtained by adjoining P and <f>, the group of the quartic equation is Ga- 
If also the root k of k^ = 4yi 4- — 4c is adjoined, the group reduces to 

(?2 = {I, (12 ) (34) }. Finally, if also a root k of = 4^2 + — 4c is adjoinedj 

the group reduces to the identity and the x’s may be found as in Ex. 1, Note 
that k k tz = 46c — 6® — 8d. The elegance of this solution is due to its 
employment of binomial resolvents only and to its introduction of the invari- 
ants J and J. 

108 . Jordan’s^ theorem on the mutual adjunction of roots. 

Theorem 10. Let the group Gi for a field F of an algebraic 
equation fiix) = 0 he reduced to Hi by the adjunction of all 
of the roots of a second equation f^ix) = 0, and let the group G^for F 
of the second equaiion be reduced to H% by the adjunction of all of the 

^ Traits des Substitutions, 1870, 268-9. 
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roots of the first equation. Then Hi and H 2 are invariant subgroups of 
Gi and G 2 , respectively ^ of equal indiceSj and^ the quotient groups 
Gi/Hi and G 2 /H 2 are simply isomorphic. 

By §91j there exists a rational function gi with coefficients in F of 
the roots Xi^ , . . ^Xn of the first equation such that gi belongs to 
the subgroup Hi of (?i. Since the adjunction of the roots yi, . . . , 
ym of the second equation reduces Gi to JTi, property A of Hi 
( §89 ) show^s that gi is a quantity of the enlarged field, whence 

( 10 ) gi(xi, . . . ,Xn) = hi(yu - • . , 

where hi is a rational function with coefficients in F, Let gu • * jQh 
denote the distinct functions obtained from gi by applying the 
substitutions of Gi. Then Hi is of index k under Gi ( §91). Let 
hi, ... jhi denote the distinct functions obtained from h by apply- 
ing the substitutions of The k functions are the roots of an 
equation irreducible in F; likewise for the I functions hi. Since 
these twm irreducible equations have a root (10) in common, they 
are identical by §80. Hence k = I and the g’s coincide in some 
order with the h's. 

If is a substitution of Gi w'hich replaces gi by then Si trans- 
forms the group Hi to which gi belongs into the group to which gi 
belongs ( §100), whose order is that of Hi. Since is equal to a 
certain A, it belongs to the field F' — F{yi, . . . , ym) and hence is 
unaltered by the substitutions of the group Hi of /i(a;) = 0 for 
that field F^ (property B of §89). Hence the group to which gi 
belongs contains all the substitutions of Hi and, being of the same 
order as Hi, coincides with Hj, This proves that Hi is invariant in 
Gi. The group for F of the irreducible equation satisfied by gi is 
therefore the quotient group Gi/Hi ( §102). 

’^This supplement and the proof here employed are due to Holder, Math. An- 
nalen, 34, 1S89, 47. For generalizations and related results, see Kneser, ibU.j 30, 
1887, 179-202; shorter proof of theorem on p. 195 in Jour, fur Math., 106, 1890, 51. 
Landsberg, ibid , 132, 1907, 1-20. Bucca, Rendiconti Circoio Mat. Palermo, 14, 
1900, 122-6. lioewy, Math. Zeitschrift, 15, 1922, 261-73. 
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Let K be the subgroup of G2 to which h (j/i , . . . belongs. It 
is of index h since hi is a root of an equation of degree I = k irre- 
ducible in F. By the adjunction of hi to F and hence of gi by (10)^ 
the group G2 oi f 2 {x) =0 for F is reduced to K ( § 92 ). If not 
merely gi {xi, . . . , but all of the x^ themselves be adjoined, the 
group G2 reduces to a subgroup of K. Hence is contained in K. 
This proves the preliminary result: If the group of /i(a:) =0 
reduces to a subgroup of index h on adjoining all of the roots of 
j^ix) = 0 , then the group of JFx) =0 reduces to a subgroup of index 
1*1 (/:i S fc) on adjoining all of the roots of fi{x) = 0 . 

Interchanging /i and in the preceding statement, we obtain 
the following result: If the group of J^ix) = 0 reduces to a sub- 
group of index hi on adjoining aU the roots of fi{x) = 0 , then the 
group of /i(a;) = 0 reduces to a subgroup of index ^ fci) on 
adjoining aU of the roots of f^ix) = 0 . Since the hypothesis for 
the second result is identical with the conclusion for the first 
result, we conclude that = fc, and ki>k> h, whence h = k. 
Since H2 is contained in K, and H2 is of index h under G2 and K 
is of index k under G2, it follows that H2 and K are identical. 

For the same reason that Hi is invariant in Gi, it follows that 
is invariant in G2. Since hi belongs to the subgroup K = H2 of 
G2, the group for F of the irreducible equation satisfied by hi is the 
quotient group (j2/il2. Since this equation was seen to be identical 
with that satisfied by gi and since the group of the latter equation 
for F was seen to be Gi/Hi, the groups G2/H2 and Gi/Hi differ 
only in the notations employed for the letters on which they oper- 
ate, and hence are simply isomorphic. 

109. Galois’s theorem on adjimction. 

Theorem 11 . By the adjunction of any one root of an equation 
f2{x) = 0 whose group for a field F is a regular cyclic group of 
prime order p, the group for F of the eqication fiix) = 0 either is 
not reduced at all or else is reduced to an invariant subgroup of 
index p. 
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This is a corollary to Jordan’s theorem, since if we adjoin one 
root Xi of fiix) = 0 we really adjoin all of its roots. For, the 
identity is the only substitution of the group for F of f 2 (x) =0 
which leaves Xi unaltered in value, whence each root is a rational 
function of Xi with coefficients in F (Theorem 10 of §91). 

110. Galois’s criterion for solvability by radicals. An algebraic 
equation is solvable by radicals if and only if its group for the field 
determined by the coefiicients is a solvable group. 

It is occasionally useful to have the generalization: 

Theorem 12. An equation is solvable by radicals relatively to 
any field F containing the coefiicients if and only if its group for F is 
a solvable group. 

That the equation is solvable when the group is a solvable group 
was proved in §107. We shall now prove the converse. Assume 
therefore that the roots Xi, ... ^Xn can be derived by rational 
operations and root extractions from quantities in F or from quan- 
tities obtained from them by those operations. The index of each 
root extraction may be assumed to be a prime since a pgth root is 
a pth root of a gth root. Such a two-story radical as well as the 
underneath radical are both listed in making a list of all the 
radicals 

( 11 ) 

which occur in the expressions for Xi, ... ^ Xn- Here Ui is in F, 02 
is in the field derived from F by adjoining the first radical, a^ 
is in the field derived by adjoining the first two radicals, etc. Also 
pij ... yPh are primes. 

By the corollary in §107, the equation for the imaginary pth 
roots of unity (where p is a prime) is solvable by radicals of indices 
< p. Those roots may therefore be expressed rationally in terms 
of radicals forming a chain of type (11 ), where now F is the field of 
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rational numbers. Write down the radicals of the chain for p == 3 
and follow them by the radicals of the chain for p = 5 , and so on 
for the primes in order up to and including the maximum of 
pi, — , p*. After the last of these radicals write the radicals (11 ) 
in order. We obtain a chain of radicals 

( 12 ) 

having the following properties: Firsts gi, . . . , are primes. 
Second, bi is in the field F, while bo, 63 , . . . are in the fields derived 
from F by adjoining the first radical, the first two radicals, . . . , 
respectively. Third, Xi, . , . ,Xn are rational functions with coeflBi- 
cients in F of these radicals, since they were such functions of the 
included radicals (11). Fourth, after the adjunction to F of the 
first r — 1 radicals ( 12 ), we obtain a field containing ah the 
imaginary grth roots of unity. 

Lemma. For afield containing A and an imaginary pth root p of 
unity, where p is an odd prime, the group of = A is the identity 
group if one root belongs to the field, but is a regular cyclic group of 
order p if no root belongs to the field. 

For, if one root is r, the remaining roots are pr, pV, . . . , r. 
All of the latter belong to the field containing p if r does, and the 
group is then the identity. Next, let no root belong to the field, 
so that A is not the pth power of a quantity of the field- Then 
— A is irreducible ( §82), so that the group is transitive and 
hence of order ^ p. The notation for the roots x% may be chosen 
so that 

X^ = pXij X 3 = pX2, . . . , Xp = pXp—i, Xi = pXp, 


Consider any substitution 


/Xi X2 Xz — ' Xp\ 
\Xa Xb Xe — ’Xi/ 
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of tlie group. By property B of §89, we have 

3/5 ” p3/a “ ^a-i-1? “ P^b “ 375+lj • • • 

As in §106, s is the power a — 1 of {X 1 X 2 X 3 ’ - ^ ^p)f which there- 
fore generates the group. 

Let G be the group for F of the given equation having the roots 
Xi, . . . , Xn- By the Lemma and Galoises theorem ( §109), the 
adjunction of the first radical (12) to F either does not affect G 
or reduces it to an invariant subgroup of index ^ 1 . The subse- 
quent adjunction of the second radical (12) either does not affect 
the resulting group or reduces it to an invariant subgroup of index 
^ 2 - After the adjunction seriatim of all the radicals (12) we have 
a field containing rri, . . . , Xn, for which the group is now the 
identity Gi. Since we passed from G to Gi through a series of 
groups each an invariant subgroup of its predecessor of prime 
index, we have a series of composition of G such that each factor 
of composition is a prime. Hence (? is a solvable group. 

111. General equation of degree n>4 not solvable by radicals. 
By §93, the group of the general equation of degree n (i.e., one 
whose coefficients are independent complex variables) is the 
S 5 anmetrie group G for the field determined by its coefficients and 
any constants finite in number. By §101, G has as an invariant 
subgroup the alternating group, which is composed of all the even 
substitutions on n letters. 

Theoeem 13. If 71 > 4, the aUemaiiag group A on n letters is 
simple. 

For, suppose A has an invariant subgroup H distinct from the 
identity group Gi, Of the substitutions of H different from the 
identity I, consider those which affect the least number of letters. 
Any such substitution h involves the same number of letters in its 
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various cycles. For, if two cycles involve a and 6 letters, where 
a <hy Ti^ is not 1 and affects fewer letters than h does, since it 
leaves the a letters unaltered. 

Having ft, the invariant subgroup H oi A contains the trans- 
form of ft by any even substitution e and hence contains p == 

he - Tt'K 

(i) If ft contains more than three letters in any cycle, say 

ft = (1234 the choice e = (234) gives p = ( 124 ), 

which affects fewer letters than ft does. 

(ii) If ft contains at least two cycles of three letters, say ft == 
(123) (456) . . . , the choice e = (125) gives p = (15243), which 
affects fewer letters than ft does. 

(iii) If ft contains at least two cycles of two letters, say ft == 
(12) (34) . . . , the choice e = (123) gives p == (13) (24), which 
affects fewer letters than ft does unless ft = (12) (34). For the 
latter ft, the choice c = (125 ) gives p= (125), which affects fewer 
letters than ft does. 

Hence the substitutions 9 ^ I oiH which affect the least number 
of letters are all of type {ahc), since (oft) is odd and hence is not 
inH. 

Next, any cycle of three lettem can be transformed into any 
other such cycle by an even substitution, so that H contains every 
cycle of three letters. For, (123) is transformed into (a5c) by 
both of the substitutions 


/I 2 3 4 5- 

- • n\ 

, , , / 1 2 3 4 5 • 

■ ■n\ 

\a h c d e • • 

.z)> 

s(de) = ( , j 

\a 0 c e a ■ 

■■1 ) 


where the dots denote the same letters in the two substitutions, 
one of which is even and the other odd. 

Finally, every even substitution e is expresrible as a product of 
cycles of three letters. For, e — tih - • • fejt, where each U is a 
traiisposition. If h and h are identical, they cancel. If they have 
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just one letter a in Gommon, = (db) (ac) = (abc). If they 
have no letter in common, 

tits - (ab)(cd) = (abc) (cad). 

Similarly for kh, , tzh^i hk* Hence H — A. 

If n > 4, the symmetric group G onn letters therefore has the 
series of composition G, A, Gi, with the factors of composition 2 
and I • 9i!. Since the latter is not a prime, G is not a solvable 
group. This proves 

Theoeem 14. 1/ n > 4, the general equation of degree n is not 
solvable by radicals. Moreover j its roots cannot be found by rational 
operations and root extractions performed upon the coefficients and 
any constants j finite in number y or upon quantities derived from them 
by those operations. 


Exercises 

1. If n > 4, there is no series of composition of the symmetric group G on 
n letters other than G, A, Gi, For, if G has an invariant subgroup H other 
than Gi, the proof of Theorem 13 shows that either H — A ot else H contains 
a transposition (ab). In the latter case H contains every transposition and 
coincides with G. 

2. If the general quintic equation is not solvable by radicals, the same is 

true of the general equation of degree n > 5. Hint: Equate to zero the coeffi- 
cients of xj for j < 5. 

3. Hence give another proof of Theorem 14 by amplifying the following 
new proof of the simplicity of the alternating group G on 6 letters. Let s 
replace 1, 2, 3, 4, 5 by any assigned permutation a, 6, c, d, e of them. If an 
invariant subgroup H oi G contains (123), it contains its transform (abc) 
by an even s. With (12) (34), H contams its transform (ab)(cd) by s or 
(12 )s, one of which is even. With t = (12345), E contains its transform 
k = (ahcde) by s if s is even. But if s is odd, I — (3245) transforms ^ =* 
(13524 ) into so that fe is even and transforms ^ into k. Hence H is of order 
A = 1 -|- 20a; -f 15y + where x, y, z take only the values 0 or 1. Also, 
A is a divisor < 60 of 60. If A has the factor 5, then s = 1 and A 25, which 
is not a divisor of 60. Hence A is a divisor of i 60 and hence A = 1. 
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4. If for a field F the group of an equation with the roots xij . . . , Xn, is 
generated by s = (xi • • • Xn), the equation is irreducible and xz = r(xi)j 
where r is a rational function with coefficients in F, and also Xz = rCxs), . . . , 
Xn — r{xn^i)j xi — r(xn). Conversely, an irreducible equation in F whose 
roots satisfy these n relations has a regular cyclic group for F. 

5. By supplementing Ex 3, show that every substitution of the alternating 
grouo on 5 letters is conjugate to I, (123), (12) (34), t = (12345 ), or 

112. Solvable quintics. The Jacobi-Cayley resolvent sextic of 
a qnintic bas been found recently by the author^ by direct, ele- 
mentary methods. The quintic is solvable by radicals if and only 
if the sextic has a rational root. He easily deduced the covariant 
resolvents of Perrin and McClintock. 

Berwick^ computed a resolvent sextic whose coeflScients are 
invariants of the corresponding quintic form. 

^Bull. Amer. Math. Soc., 31, 1925, 515-23. For this and other resolvents, see 
Brioschi, Math. Annaien, 13, 1878, 109--60. 

^Proc. London Math. Soc., (2), 14, 1915, 301-7. Result quoted in Mathews 
Algebraic Equations, 1915, 55. 



Chapter XI 

CONSTRUCTIONS WITH RULER AND COMPASSES 

113. The problems to be solved. We shall prove that it is not 
possible, by the methods of Euclidean geometry, to trisect all angles, 
or to construct a regular polygon of 7 or 9 sides. For these problems 
the ancient Greeks sought in vain for constructions with ruler and 
compasses. We shall specify all the values of n for which a regular 
polygon of n sides can be so constructed. One such value is 17. 
The fact that a regular polygon of 17 sides can be constructed 
with ruler and compasses was not suspected during the twenty 
centuries from Euclid to Gauss. 


114. Analytic criterion for constructibility with ruler and com- 
passes. The first step in our consideration of a problem proposed 
for construction consists in formulating the problem analjrfcically. 
In the ancient Delian problem of the duplication of a cube, we 
take as the unit of length a side of the given cube, and seek the 
length a: of a side of another cube whose volume is double that of 
the given cube, whence x® = 2. 

For the problem to trisect a given angle A, we employ the 
trigonometric identity 


cos A = 4 cos^ — — 3 cos 
3 3 


Multiply each term by 2 and write x for 2 cos Thus 
(2) — 3x — 2 cos A == 0. 


Given A we can construct (with ruler and compasses) a line whose 
length is the positive value of ± cos A ; it is the adjacent leg of a 
right triangle, with hypotenuse of unit length, formed by dropping 
a perpendicular from a point in one side of A to the other side, 
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produced if necessary. If it were possible to trisect angle i.e., 
construct angle we could as before construct a line whose 
length is ± cos fA, and hence a line whose length is double that 
of the former, viz., ± x. 

In each of these problems we have been led to the question of 
the possibility of constructing a line whose length is + where x 
is a real root of a cubic equation with known coefficients. 

Ceiterion.^ If X is a real root of a given equation^ it is possible to 
construct with ruler and compasses a line of length i: x if and only if 
X can be derived by rational operations and extractions of real square 
roots performed a finite number of times upon the coefficients or upon 
numbers obtained from them by those operations. 

First, if X can be derived in the manner indicated in the criterion, 
we can construct a line of length ± x. For, a rational function of 
given numbers is obtained from them by additions, subtractions, 
multiplications, and divisions. The construction of the sum or 
difference of two segments of straight lines is obvious. To con- 
struct segments whose lengths are the product p === ab and quo- 
tient q = a/b of the lengths a and b of two given segments, use 
similar triangles two of whose pairs of corresponding sides are 
1, b and a, p, and 1, q and b, a, respectively. Finally, a segment s 
of length where n is positive, may be constructed by drawing 
a semicircle on a diameter composed of two segments of lengths 
1 and n, and drawing a perpendicular s to the diameter at the 
point which separates the two segments. 

Second, suppose that we can construct a line segment of length 
± X, when we are given segments representing the coefficients of 
the equation for x, whose leading coefficient is unity. We may 
choose any line as the first constructed line OX and choose any 
point 0 on it as the point from which we draw another line or 
about which as center we draw a circle. Take 0 as the origm and 

^ Descartes, La g$om6tiie, Leyde, 1638, livre premier (German transl. by ScMe- 
sanger, Berlin, 1894; Elnglisb transl.. Open Court Publishing Company). 
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OX as the x-axis of a system of rectangular coordinates. The fur- 
ther lines and circles constructed are determined by points which, 
with the exception of 0, are located as the intersections of straight 
lines and circles. The coordinates of the intersection of two inter- 
secting straight lines are evidently rational functions of the 
coefficients of their equations. If the straight line y = mx + b 
intersects the circle 


{x - pY + (y - qY = 

the coordinates of the points of intersection are found by eliminat- 
ing y, solving the resulting quadratic equation for x, and inserting 
the roots x into y = mx + b; hence the coordinates are obtained 
from m, fe, p, g, r by rational operations and the extraction of a 
real square root. Finally, two intersecting circles cross at the 
intersections of one of them wdth their common chord, so that this 
case reduces to the preceding. Since each such circle or line other 
than OX was determined by two points which (with the exception 
of 0) were found as intersections of earlier lines and circles, the 
coefficients of their equations are obtained by rational operations 
and extractions of real square roots performed either upon the 
coefficients of the given equation or upon numbers obtained from 
them by those operations. 

115. Trisection of an angle- To prove that it is not possible to 
trisect angle 120° with ruler and compasses, note that cos 120° = 
— I, so that (2) becomes 

(3) x3 - 3x 4- 1 = 0. 

By Ex. 3 of §81, this equation is irreducible in the field R of 
rational numbers. Since its discriminant is 81, the alternating 
function has a rational value ± 9. Hence the group for R is the 
alternating group Gz of order 3. By the adjunction of a square 
root, the group is either not changed or else is reduced to a sub- 
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group of index 2 (§109). The second alternative is excluded. 
If a root of the cubic equation could be found hy rational oper- 
ations and extractions of square roots performed upon its co- 
efficients or upon numbers obtained from them by those opera- 
tions, the adjunction of that root to R would not reduce the group 
(t 3 , whereas the adjunction of any root reduces Gz to the identity 
group. The above criterion therefore proves that it is not possible 
to construct a line whose length is the root 2 cos 40° of the cubic 
equation, i.e., to construct angle 40°. Hence it is impossible to 
trisect angle 120° with ruler and compasses.^ 


Exercises 

1. A regular polygon of 9 sides cannot be constructed with ruler and 
compasses. 

2 The duplication of a cube is impossible with ruler and compasses- 
Hints: The group for Roi = 2 is the symmetric Ge. The adjunction of one 
root reduces it to a group of index 3. 

3. It is impossible to trisect an angle whose cosine is §, i, i, I, or p/q if 
p and q (q > 1) are integers without a common factor and q is not divisible 
by a cube. 

4. It is possible to trisect an angle whose cosine is if the 

integer a is numerically less than the integer h. Hint: Take cos |A = a/h in 

( 1 ). 

5. It is impossible to construct lines representing the lengths of the edges 
of a rectangular parallelepiped having a diagonal of length 5, surface area 24, 
and volume 1, 2, 3, or 5. 

6. It is impossible to construct a straight line representing the distance x 
from the circular base of a hemisphere to the parallel plane which bisects the 
hemisphere (problem of Archimedes) Hint: The equation for a: is (3). 

116. Regular polygons. The construction of a regular 7i-gon by 
ruler and compasses is equivalent to that of angle 27r/n and hence 
of a line of length ± cos 2T/n. The irreducible equation satisfied 

^The author gave an elementary proof without group theory in Amer. Math. 
Monthly, 21, 1914, 259-62; Mathematics Teacher, 14, 1921, 217-23. 
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by the latter number is much more difficult to form and treat than 
that having the root 

(4) r == cos 27r/n + i sin 2Tr/n, 
which is an nth. root of unity. We have 

1/r = cos 2Tr/n — i sin 2^/^? ^ ^ cos 2Tr/n. 

Hence if r can be expressed in terms of ^ and real square roots, 
cos 2'ir/n can be expressed in terms of real square roots. The con- 
verse follows from (4) and the fact that the sine can be found from 
the cosine by a real square root. Hence a regular n-gon can be 
constructed with ruler and compasses if and only if the imaginary 
nth root (4) of unity can be found by extraction of square roots, 
all except the last one of which is real. 

First, let n = 'p% where p is an odd prime. Then r is a root of an 
equation of degree 6 = (p — 1) whose group for the domain 
R of rational numbers is a regular cyclic group G of order e ( §106). 
Since the identity is the only substitution of G which leaves a root 
unaltered, the adjunction of that root reduces G to the identity 
group Gi. If a regular p*-gon can be constructed with ruler and 
compasses, the adjunction of the root r is equivalent to that of 
several square roots, the adjunction of each of which causes either 
no reduction in G or a reduction to a subgroup of index 2 ( §109). 
Since the adjunction of r reduces G to Gi, its order e is a power of 
2. If a > 1, e has the odd factor p and is not a power of 2. If 
5 = 1, then e = p — 1; if also e = 2®^ where a is odd, then p = 
2® ^ + 1 has the factor 2^+1 and is not a prime. Hence if a regular 
p^-gon can he constructed, where p is an odd prime, then s = 1 and 
p is of the form 

(5) 2^ + 1. 

For = 0, 1, 2, 3, 4, the corresponding numbers (5) are 3, 5, 17, 
257, 65537, all of which are primes. But for fc = 5, 6, 7, 8, 9, 11, 
12, (5) is known to be not a prime. 
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Conversely, if p is a prime such that p - 1 = 2^, a regular 
p-goiL can be constructed with ruler and compasses. For, the 
group for R of the equation for r in (4) is a regular cyclic group 
0 of order 2^. Hence 0 has a series of composition for which each 
index is 2. 

By §104, the solution of the equation for r reduces to the solu- 
tion of a series of quadratic equations.^ 

Next, let n = abj where a and b are integers without a common 
factor >1. If a regular a-gon and a regular 6-gon can be con- 
structed with ruler and compasses, the same is true of a regular 
72.-gon. For, any multiples of the angles 2x/a and 27r/fe can then 
be constructed and hence also the difference of these multiples. 
By §78, there exist positive integers c and d such that ca -- db = 1. 
Hence we can construct the angle 


c • 



ab 


{ca — db) 


ab 


and therefore a regular a6-gon. Conversely, from the latter we 
obtain a regular a-gon by using the vertices numbered 1, 6 + 1, 
2?> + 1, . . . , (a — 1) 6 + 1. Hence if n — , where 

p^Qj... are distinct primes, a regular n-gon can be constructed if 
and only if a regular p*~gon, a regular g^-gon, . . . can aU be con- 
structed. Since a regular 2^-gon can be constructed by repeated 
bisections of 180® our results may be combined into the 

Theorem. A regular polygon of n sides can be constructed with 
ruler and compasses if and only i/ n = 2^ Pi P 2 . . . , where pi, p 2 , . . . 
are distinct primes of the form (5). 


^In. Dickson’s First Course in Uie Theory of Equations, 1922, 41-44, these 
quadratic equations are obtained when p == 17 and a construction of a regular 
17-gon is deduced. The same results are given also in his article in Monographs on 
Modem Mathematics, Longmans, Green, & Co., 1911, which gives also an exposi- 
tion of the method of Gauss for finding the series of equations whose solution leads 
to r for any prime p. 
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117. TscMmliaus transformatioiis. Let iCi, . . . , Xn be the 
roots of 

(1) fix) = + Cl X^"^ + • • • + Cn = 0. 

To find the equation having the roots yi = Xi + 2^, we have merely 
to apply the transformation y = x + ?;; we get 

fiy — v) + (ci — + • • • = 0. 

The coefficient of 2/”~^ is zero if v = Ci/n. Hence we can remove 
the second coefficient by rational operations. 

To find the equation whose roots are the squares of those of 

(1) , transpose all terms of odd degrees, square each new member, 
and then replace x^ by y. 

These transformations y — x y = x^^ bs well as = 1/x 
(which is used to find an equation whose roots are the reciprocals 
of the roots of a given equation), are all cases of a Tschirnhaus 
transformation 

(2) y=g(x)/h{x), 

where g and h are polynomials such that A (x) vanishes for no root 
of fix) = 0. 


Exercises 

1. The equation whose roots are the squares of the roots 1, 2, — 2 of 
— — 4x4’4 = 0is^^ — 9^/* + 24ty — 16 = 0. 

210 
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2. Find the equation whose roots are the cubes of those of 2 ;^ -f ca; + 5 = 0. 
Hint: 

(““ ^ Zahx (ax -}- 5). 

Replace ax hhy — and then write y for 3^. 

3. To find the equation whose roots are the fcfch powers of the roots of (1 ), 
employ the fcth roots n, . . . , r * of unity and 

i k 

P = n /(ifl-.) = n[(!/r< - Si) ••• (j/Ti - a:„)] 

i“l t-1 


^ » 

= n [(yn - ic,) • • • (yrii — x,)] = (~ !)(*+!)« n (y* — Xj^). 

Hence the expansion of the initial product P contains only powers of y whose 
exponents are multiples of h. Write « s. 

4. Transformation (2) is only apparently more general than y = Fix), 
where P is a polynomial. Hint: By §78, there exist polynomials s(x) and 
t(x) for which 1 = s(x)f(x) -f* t(x)h(x). Take 3,8 x b. root x, oif(x) = 0. 

5. Let <t>(y) - 0 be the equation having the roots yi = P(xi) for 
f == 1, . . . , n. If yi, . . . , y„ are distinct, then xi = r (yi), . . . , Xn = r(y„), 
w-here r{z) is a rational function of z whose coefficients are derived rationally 
from those of P(x) and f(x). Hence if we solve <^(y) =0 and find that its 
roots are distinct, we can find the roots of / = 0 rationahy in terms of the y,. 

Hint: The greatest common divisor dt(x) of f{x) and P(x) — yt is found 
by rational operations, and is of the first degree in x (otherwise it would vanish 
for two distinct roots Xi and a:, of / = 0, and then P{x) '-yi would vanish for 
X — Xj, whence y,- = y,). 

6. If (y) = 0 in Ex. 5 has a root y> of multiplicity m», the determination 
of the m^ roots of /(x) = 0 for which P (x) has the value y^ requires the solu- 
tion of an equation d^(x) = 0 of degree mi. 

Hint: For each of the m, roots, both f(x) and P(x) — y* vanish, and 
therefore also dx(x), which is a linear combination of them ( §78). 

118. Principal equations. We can always transform (1) into a 
principal equation of degree n in 2 ^ in which the coefficients of 
and are both zero. By §117, we may assume that Ci = 0. 
Let C 2 0. Write for the sum of the fcth powers of the roots 
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Xi, . . . , Xn- By Newton's identities, Si = 0, S2 = — 2c2. Employ 
the transformation y ^ + ax + b. Write for the value of 

y when x = Xjb. Then ^yk — S2 + nb will be zero if we take 
fe — — 82 / 11 . Next, 

2/^ = x^ + 2ax^ + (a^ + 26 )x^ + 2a6x + 6^, 

whence 

2 2 /^* = S4 + 2as3 + (a* 4- 2b)s2 + = 0 

is a quadratic equation for a in which the coefficient §2 of is not 
zero, and hence has two roots. Choosing either root, we may 
compute similarly ^ • • • and then compute by New- 

ton's identities the coefficients of the equation having the roots 
2/n • . - > Vn. 

Theorem^ 1. By a linear transformation, or by a quadratic 
Tschirnhaus transformation whose coefficients involve a single square 
root, any equation can be reduced to a principal equation. 

119 . The Bring-Jerrard normal form. We shall transform any 
principal equation into one in which the coefficients of 
yn- 2 ^ yn-z g|][ 2ero. By hypothesis, ai = §2 = 0 in (1). If also 
^3 = 0, then Cl = C2 = Cs == 0 by Newton's identities, and no 
transformation is necessary. Next, let S3 5*^ 0. The functions 

(3) g{x) = X® + ox^ — sz/n, h{x) = x^ + 6x2 — 

evidently have the property ^ ^ 0, 21 A = 0, where 22 ^ 

denotes XI 9 (a:*). The conditions for 22 ^9 = 22 ~ 0 

S4 + asz = 0, S5 4* bsz == 0, and may be satisfied by choice of 
a and 6. The function 

(4) ^(x) zg + wh 

^ TscMmliaus, Acta Eruditomm, 2, 1683, 204-7. 
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will have the property — Oil 

(^) gh + = 0. 

This homogeneous quadratic equation in z and w can evidently be 
satisfied by values of z and w not both zero. Then x and ^ have 
the properties 

(6) E ^ = 0, E = 0, 22 = 0, E = 0, E = 0. 

Write y = ux v^/. Then E ~ ^ ^ every u 

and V. The condition for E ^ is ^ cubic equation inu :v and 
hence can be satisfied by values of u and v not both zero. In the 
resulting equation in y, the coefficients of 
therefore all zero. 

We saw that any equation F{z) =■ 0 can be reduced to a prin- 
cipal equation J{x) ■= 0 by a transformation x = P(z), To 
fix) = 0 we just applied a transformation y == Q(x). Hence the 
final equation in y may be derived from F(s) = 0 by the single 
transformation y = Q[Piz)]. In other words, the product of two 
Tschirnhaus transformations is a Tschirnhaus transformation. 

This completes the proof of 

Theoeem 2. By means of a Tschirnhaus transformation whose 
coefficients involve a cube root and three square roots, any equation of 
degree n can be transformed into an equation of degree n in yin which 
the coefficients of y^"^, y^~^, y^^ are all zero. 

This theorem is usually ascribed to Jerrard.^ But for n = 5 it 
was obtained much earlier by E. S. Bring.^ We therefore caE 


^ Math.. Researches, Bristol and London, 1834, Pt. II. Report by W. R. Hamil- 
ton, British Assoc. Report, 5, 1837, 295-348. 

^Meletemata . . . transformationem aeqnatiomim algehraicarmn, Lund, 1786. 
Substance reproduced in Quar. Jour. Math,, 6, 1864, 45—47 ; Archiv. Math. Phys., 
41, 1864, 105-112; Annali di Mat., 6, 1864, 33-42. 
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(7) 2 /® + dy + e = 0 

the Bring-Jerrard normal form of quintic equations. 


Exercises 

1. Any equation of degree 3 or 4 can be solved by radicals* 

2. Every quintic equation can be reduced to tbe form dy^ e — 0 by 
a TscMmliaus transformation whose coefficients involve a square root and a 
root of a quartic equation. Hint: If ci = C 2 = 0, then S 4 + 4c4 ~ 0; hence 
use 2) 2 /* = 0. 

3 By choice oi tiny — tz, reduce (7) and the quintic in Ex 2 to normal 
forms involving a single parameter. 

4. Derive (7) geometrically. Hints: Let the transformation y = ao aix 
+ 02 4- da a:® -f ^4 replace (1 ) by y^ di 'h • • • =0. Then 

d] is a. homogeneous polynomial in do, , cl^ of degree By means of di = 0, 
express do as a hnear homogeneous function of di, . . . , a^. Interpret the 
latter as the four homogeneous coordinates of a point P in space (the lengths 
of the perpendiculars from P to the faces of a jSxed tetrahedron of reference 
being proportional to di, . . . , d 4 ). EKminate do from (i 2 == 0 and ds = 0. The 
resulting quadratic equation represents a quadric surface, which contains 
real or imaginary straight lines; usually there are two such lines through each 
point of the surface and then they are found by means of a quadratic equation 
in one unknown. The intersections of a chosen one of these lines with the cubic 
surface obtained from ds = 0 are found by solving a cubic equation in one 
unknown. 

120* The Brioschi normal form of quintic equations. In the 
preceding Ex. 3, we saw that any quintic equation can be reduced 
to a normal form involving a single parameter by means of trans- 
formations involving both square roots and a cube root. By 
means of a transformation involving only square roots, we shall 
now reduce any sufficiently general quintic equation to a remark- 
able form which also involves a single parameter and which plays 
a central r61e in the theory of quintic equations. 

We start with a principal quintic f(x) = 0, having therefore 
Si = 0, ^2 = 0. We assume that sa ^ 0, and employ the poly- 
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nomial (4) having the properties (6). We shall first prove the 
existence of constants p, g, . . * , i (p and q not both zero) such 
that 

(8) piP' + 2qx^|/ + rx^ -- a^p — hx + t ^ 0 [mod f{x)]j 

which means that the left member is the product of /(x) by a 
polynomial in x. 

First, let z = 1, ttJ = 0 in (4), whence \p is the cubic p. From 
4^^ we eliminate x® and x® by means of /(x) = 0, then eliminate x^ 
by means of X’p and finally x® by means of ;//; we get (8) with 

p = 1. 

Second, let u? 0. We may take to = 1 and write 
yp ^ x^ + Z7? + dx^ + e. 

For a certain quadratic function Q of x, we have 

C = xi/^ — / (x) — = fcx^ + Q, k — d -- z\ 

If fc = 0, this gives (8) with p = 0, 2g = 1. If ft 0, we elim- 
inate X® and higher powers of x from \p^ by means of / (x) = 0, then 
x^ by means of \pj and finally x^ by means of C, and obtain (8) 
withp = 1. 

Inserting the five roots of /(x) = 0 into (8), summing, and 
applying (6), we see that i = 0. 

If X and <p are any linear functions of x and tft, (6) imply 

(9) S X == 0? 22 = 0, X ^ 12 = 0, 

while (8) implies a similar relation 

flO) px^ + 2g<^x + — ax — ^ 0 [mod/(x)]. 
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We employ the identity 


( 11 ) 

where 


m F = {dx + e<j>y - c(ax + 
F = px^ + ^^4>x + 


( 12 ) 


m = p 52 __ 2qah + ra-, c = q- — pr, 
e =z hq — ar. 


d — bp — aq, 


We exclude the special quintics for which c = 0 and those for 
which both a and b are zero. The linear functions x cj) of 
X and \p may be chosen so that ax + b<j> is identical with ux + v\l/j 
where u and v have any prescribed values not both zero. We 
can choose them so that ux + v\l/ vanishes for no root oi fix) = G. 
Hence it is permissible to apply the transformation 


(13) 


dx + e(f> 
ax + b4> 


to f(x) = 0. Denote the resulting equation by 

(14) ^ + aiy^ + 02 + as + a^y + as = 0, 

We determine its coefldcients as follows: By (13), 

dx + e(l> + c^ax + b<j)) 
y + = —TTZ 


If m in (12) were zero, F would be divisible by pcx + i><t> and the 
quotient of (10) by ax + b4> would be divisible by /(a;), whereas 
its degree is ^ 4. Then if the numerator of the preceding fraction 
is zero for either value of c^, (11) shows that F = 0, and thus (10) 
shows that ax + is divisible by /(:r), contrary to its degree. 
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By (10) and (11), 

"^■(“X + ^4>) — (dx + e<i>y — c(ax + l>(j>y [mod/(a:)]. 
Hence, by division, 

—j—i = dx + e<j>-ci(ax + b^) [mod /(a;)]. 

y ~r 

(9)) this implies 

^2 = 0,1^22 = 0 if z = l/iy + ci). 

Hence if we replace 2/ by (1 — zc^)/z in (14), we obtain 

(1 — ZC'^y + 2 (1 — ZC^y + 02 2^(1 ~ 

+ asz^il - zc^y + a^z^il - zc^) + as^ = 0, 

in which the coefficients of z^ and ^ must vanish. Thus 

5c2 — 4ai + 302 c -* 203 0^ + 04 == 0, 

~~ lOc^ + 6ai c — 302 + 03 = 0. 

Since these hold for both values of 

5c2 + 302 c + 04 = 0, 4ai c + 203 == 0, 

10c -f" 3 o 2 = 0, 601 c “4" 03 == 0, 

whence Oi = 03 = 0 , 02 = — 1 10 c, 04 = Sc^. Write c = — 3C. 
Then (14) becomes 

(15) + lOC^ + 45(7^ ^ + Os = 0, 

This Brioschy normal form is especially adapted for solution^ 
in terms of elliptic functions. The effect of writing y ^ C^Y m 

1 Annah di Mat., 1, 1858, 256-9, 326-8. 

2 Kiepert, Jour, fiir Math., 87, 1879, 130. 
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(15) is to put C = i. However, by a transformation not iuvolvmg 
radicals, we may pass from (15) to a quintic equation having a 
single parameter ( §131, end). 

The only irrationality employed in the transformation (13) of 
the principal quintic /(a:) = 0 to the form (15) was the square root 
of the discriminant of (5). We may express the latter in terms of 
the discriminant of f{x). By definition, D is a determinant 
whose kth. row is 1. Xk^ x%j In view of (3), JD is equal to 

the determinant whose fcth row is 


1 , k(Xk')* 

Let H' denote the determinant having these elements in the kth 
column. Forming the product D'D by using the rows of D' and 
the columns of D, we see that 


5 0 0 0 0 


= 


0 0 S3 0 0 

0 S3 S4 2 

0 0 Y^gh 


= - 5s3^ 




Ugh 


Hence the discriminant of equation (5) is equal to 


Theorem^ 3. If we denote the product of the squares of the differ-- 
ences of the roots of the prindpaZ quintic f{x) =0 by 5^ A, that 
equation can he reduced to the form (15) by a Tschimhaus trans- 
formation whose coefficients involve the single irrationality AL 

Brioschi^ proved the existence of i(^ — 1) polynomials 
Xi{i == 1, . . . , i(n — 3)] such that the sum, sum of squares, and 


^ The above proof is a material simplificatioii of that by Gordan, Math. Anaalen, 
28, 1887, 152-166 (Jour, de Math., (4), 1, 1885, 455-8). He introduced an unneces- 
sary irratioiiality, the square root of the discriminant of F, since he used instead of 
(13) a transformation involving the linear factors of F. The part of the proof be- 
ginning with (11) follows Weber, Algebra, ed. 2, 1898, I, 264-7. 

^Rendiconti 1st. Lombardo, (2), 20, 1887, 364-70,* Opere Mat,, HI, 293-9. 
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sum of products of any two, summed for all the roots of the equa- 
tion of degree are all zero. This was proved in (9) above when 
n = 5. 

In his Chicago thesis to be published soon, R. Garver proves 
that every Tschimhaus transformation of a sufficiently general 
principal equation of degree n into another principal equation 
can be expressed in the form y = ux + v<i>, where x and <t> are 
polynomials of degrees n — 1 and n — 2 which satisfy (9). 
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GROUPS OF THE REGULAR SOLIDS; 

QUINTIC EQUATIONS 

121. Introduction. The first part of this chapter gives a theory 
of the invariants of each finite group of linear transformations on 
two variables. These groups are isomorphic to groups of rotations 
leaving unaltered the various regular solids. The final part is an 
application to the resolvents of the general equation of the fifth 
degree and furnishes a method of solving the latter. The subject 
plays an important r61e in the theories of elliptic modular func- 
tions and automorphic functions. 

The theory is due to Klein, whose book^ is a classic, but pre- 
sents difficulties to beginners on account of the introduction of 
ideas from many branches of mathematics. We shall give a simple 
exposition of the essentials of this interesting theory. 

122. Linear fractional transformation on z corresponding to a 
rotation. To readers acquainted with the theory of functions of a 
complex variable there is available^ a short proof of the desired 
formula (7). We shall give here a strictly elementary proof. 

We first define the stereographic projection of a sphere upon a 
plane. Employ a rectangular coordinate system in space. The 
sphere with radius unity and center at the origin 0 has the equa- 
tion 

(1) r + 7,2 + = 1. 

From the point N = (0, 0, 1) project an arbitrary point 
P = (S) V, D of the sphere to a point Q = (x, y) of the |l^plane. 

^Voriesungen uber das Ikosaeder, 1884. 

^Burkbaxdt’s Funktaontlieoretische Vorlesungea, I, 2, 1903, 49-52 (Englisli 
traiisl. by Easor, 81). 
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Let T be the fourth vertex of the rectangle having sides ON and 
OQ. Let SPF be parallel to NO. Let FG and QH be parallel to the 
7;-axis. Then OG — FG = t], PF = OH = x, QH — y. Since 
the triangles NSP and NTQ are similar, 



Fig. 1. 


1 - ^ -.1 = SP : TQ = NS : NT OF : OQ= n-V == k'-x. 


Hence 


( 2 ) 

By (1), 



y = 


V 

1 - r' 


1 + a:® + = 


{i-ty+e + ^ 

(1 - ty 


1 - 2r + 1 

(1 - iY 


whence 


2 

1-f’ 


(3) I = 2x/r, 7] = 2y/r, f = 1 — 2/r, r = 1 + x^ + y^. 

Write z = X + iy. Formulas (2) and (3) establish a one-to- 
one correspondence between the points of the unit sphere and the 
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points of the complex 5f-plane, provided we agree to identify aU 
points at infinity in the 2:-plane (so that s = oo alone corresponds 
toN). 

Let E ^ (X, ju, j/) and P = ($, >7? ?) be any points on the sphere 
having radius unity and center at the origin 0. Write 

. a , .a , a ^ a 

(4) a == X sin -3 0 = /x sin -3 c = v sm.-3 a = cos 

Jh ^ 

whence 

(5) + 62 + = 1. 

Consider the rotation about an axis OE through angle a counter- 
clockwise when viewed from E toward 0. Let it replace rjj f ) 
by (!', v') fO* We employ the formulas (28) of §46 which repre- 
sent the rotation, with ai, <22, as replaced by a, 6, c, and Wij W 2 , Wz 
replaced by 77 , and pu P 2 , Pz replaced by ??', f'. 

Writes = X + it/, == a; — iy. Then (3) become 

z + w i{w z) zw — 1 

^ ^ j ^ j 

1 + zw 1 + zw I + zw 


Hence 

1(1 + zw)(l — f') = + d2 + 2ti;(l ^ ^ (P) ez + eWy 

where e = 6d — ac + i(ad + fee) = (fe + ia) (d + ic). Replac- 
ing 1 — — cP by the equal value -h fe^, we get 

id + ^) (1 - n - [(6 + ia)z + d - ic]S, 
f (b — ia)w + d + ic. 

Similarly, 


i(l + m)(r + ^1?') - [(d + ic )0 - (6 - ia)]/. 
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Writing z' = a:' + iy', and employing (2) in accents, we get 
i + iy , ^' + iv' 

(6) z = :: > z = — j> 

1 — r 1 — f 

, {d + ic)z - (b - ia) 

o) 2 = rr. I • \ — rU ^ 

(o + ta)z + a — %c 

Theorem 1. Every rotation corresponds to a transformation (7). 


Exercises 

1. By (4) and (7), with X = /x — 0, p = 1, prove that the rotation about 
the f-axis ON through angle a counterclockwise viewed from N is repre- 
sented by 

( 8 ) z' = z. 

2. Prove ( 8 ) by use of the formulas from analytic geometry, 

^ cos a — 17 sin a, 17' = ^ sin a + 17 COS a. 

123. The tetrahedral group. Consider a cube inscribed in a 
sphere with the center 0 and radius unity. Four of its vertices 
1, 2, 3, 4, shown in Fig. 2, are the vertices of a regular tetrahedron. 
Their diametral points 1', 2', 3', 4' are the vertices of a second 
regular tetrahedron called the diametral tetrahedron to 1234. 

The mutually perpendicular lines AA% BB\ CC% each of 
which joins the mid points of a pair of opposite edges of 1234, 
are taken as the ^axis, ^r-axis, f-axis, respectively, their positive 
directions being from 0 to A, jB, C, respectively. 

The tetrahedron 1234 is unaltered by a rotation through angle 
0% 120®, or 240® about any of the lines 01, 02, 03, 04 as an axis, 
and also by a rotation through 180® about one of the axes AA', 
BB', CC'. These include the identity rotation (through angle 
0®), eight rotations of order 3, and three of order 2. They give all 
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the rotations leaving the tetrahedron 1234 unaltered and hence 
form a group called the tetrahedral group iSi 2 . For, exactly three 
such rotations leave the vertex 1 unaltered, while their products 
by any one rotation (about axis 04) which carries 1 to 2 give all 
the rotations which carry 1 to 2, and similarly when we use 3 or 
4 instead of 2, whence the total number is 3 X 4. 

Consider in particular the rotation S about the axis 01 through 
angle 120° counterclockwise when viewed from 1, and the rotation 
R about the axis CC' through angle 180°. Since R interchanges 
1 and 4, RSR and RS^ R leave 4 unaltered and hence give 
rotations about the axis 04 which carry 1 to 3 and 2 respectively. 
By the final remark in the preceding paragraph, the identity, S, 
and together with their products by RS^ R, RSR, and R 
(which carry 1 to 2, 3, and 4 respectively) give all the rotations of 

12 - 

Theorem 2. The tetrahedral group R 12 of all rotations which 
leave unaltered the regular tetrahedron 1234 is generated by the two 
rotations S and R, 

We shall next obtain by §122 the linear fractional transforma- 
tions on 0 which correspond to the rotations S and R. In the case 
of R, we have merely to employ (8) for a = t to obtain — z. 
Next, the axis 01 of rotation S makes the same acute angle with 
the positive directions of the coordinate axes and hence the 
coordinates of point 1 are \ = = j' = 3“^. Since a = 120° and 
cos 60° = I, sin 60° = | • 3^, (4) gives a:=b = c = d = ^. 
Then (7) becomes 

(9) - i+i 

z — % 

Theorem 3. The group Fn of all tetrahedral linear fractional 
transformations is generated by (9) and — z. 

To find the product AB of a transformation A defined by (7) 
by a second transformation B :z' — fiz)^ we write the latter in 
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the form js?" = /(^O and eliminate z' between the final equation 
and (7). The resulting transformation 2 " == is defined to be 


AB. 


Exercises 

1. Tbe group Rn is simply isomorphic to the alternating group on the 
letters 1, 2, 3, 4 denoting the vertices. 

2. The 12 transformations of F 12 are 


( 10 ) 


1 .2 + 1 

s = i s, ± -j ± 1 -j 

Z 2 — 1 


± i 


2 — 1 
2 + 1' 


2 + t 2 — f 

± 'j ± — ; — ;* 

2 — i z -jr ^ 


Hints: is given by the third function (10) with the upper sign, 

while L s ^-3. 22 ^ is 2 ' = I/ 2 . xhe last may be computed as a product or 
proved by noting that L leaves A fixed and hence is the rotation through 180° 
about axis AA', For X = l, fi ^ v ^ 0, a = 180°, (4) gives a — 1, 5 = c 
= d = 0, whence (7) becomes 2 ' = i/ {i 2 ). 

3. If J? (or 2 ' = — 2 ) is the rotation through 180° about axis CC' and if 
jr, (or 2 ' = 1 / 2 ) is that about AA\ their product M ^ RL (or 2 ' - — I/ 2 ) 
is the rotation through 180° about the conunon perpendicular BB' to CC* 
and AA\ Hence show that B, L, M, and the identity rotation form a com- 
mutative group of order 4. 


To each transformation (7), of determinant unity, corresponds 
the linear homogeneous transformation 

U" = (d + ic)u ~ (6 - ia)v, 

V {h + + (d — ic)Vj 

of determinant unity, as well as that obtained by changing the 
signs of a, h, c, d, which is therefore the product of (11) and 
C7 = — u, F = — in either order. 

In particular, to z' = izl{— i) and (9) correspond 

(12) 17 = w, F = — iVj 


( 13 ) 


C/ == 1(1 + i)u - 1(1 - i)v, 
V = 1(1 + i)u + 1(1 — i)v. 
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Theorem 4. The group H24 of all tetrahedral homogerwous linear 
transformations is generated by (12) and (13), each of determinant 
unity. 

124. invariants of the tetrahedral group H24. The vertices of 
the tetrahedron 1234 have the coordinates 

(14) ? = ± 3-^, ^ = ± 3-1, ? - ± 3-1, 

where an odd number of the signs are plus. By (6), the corre- 
sponding values of z are ± a and ± 6, where 

l+i ^ 1 -i 

3» - 1 ’ 3i + l‘ 

We seek the product of the factors av, hv. Since 
a2 = ^(2 + 3I) and b^ — — i(2 — 31) have the product 1 and 
sum 2 • 3lf, we get 

(15) »i> = — 2 - 31 + v^. 

Since the 12 rotations which leave the tetrahedron unaltered 
merely permute the four vertices, each of the 24 tetrahedral homo- 
geneous transformations merely permutes the above four linear 
functions, apart from constant factors, and hence leaves # un- 
altered up to a constant factor. That factor is evidently 1 for (12), 
while for (13) it is 

[i (1 + i)Y(l - 2 ‘ 31 f + 1) = 1 + 31 z) == 

Hence is an absolute invariant of the group Hu^ 

The vertices of the diametral tetrahedron 1'2'3^4' have the 
coordinates (14), where now an odd number of the signs are 
minus. The corresponding values (6) of z are ± d and ± 5. 
Hence the group has the absolute invariants and where 


( 16 ) 


^ -j- 2 • 31 iu^ 
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Any rotation wMch leaves the tetrahedron 1234 unaltered mere- 
ly permutes its six edges and hence their middle points. It there- 
fore permutes the points in which the axes of coordinates intersect 
the unit sphere; the corresponding values of 0 are evidently 0, co, 
+ 1, ± L The product of the linear functions which vanish for 
these values of z — u/v is 

(17) t = uv{u^ — 

It is an absolute invariant of the group, being unaltered by (12) 
and (13). 

A point on the sphere is called a special point with respect to 
the group i?i 2 of rotations leaving the tetrahedron 1234 unaltered 
if the point takes fewer than 12 distinct positions under these 12 
rotations. In other words, a point is a special point if and only if 
it is unaltered by at least one of those rotations other than the 
identity, and then the point lies on the axis of the rotation. 

The axes AA\ R5', CC^ of the rotations of order 2 are the 
coordinate axes. They intersect the sphere in the six special points 
for which the invariant t vanishes. 

The eight rotations of order 3 of R 22 have as axes the diameters 
through the vertices. The latter are the four special points for 
which the invariant vanishes. The remaining intersections of 
those diameters with the sphere are the vertices of the diametral 
tetrahedron and hence are the four special points for which the 
invariant ^ vanishes. 

Consider any homogeneous pol 3 rnomial in u and v which is an 
invariant of H 24 . If it vanishes for a special point, it has one of the 
factors t, #, After removing all such factors, we obtain an 
invariant quotient Q which vanishes for no special point. Since 
Q{u,v) is homogeneous in u and v, the equation Q{z^l) ~ Q has a 
root so that Q(m, ?;) vanishes for = ?7, == F, where U /V 

= 0 '. Let / and g denote any two of the absolute invariants 

each of degree 12. Then also / — is an absolute invari- 
ant, if c is any constant. The condition that it shall vanish at 
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O', V uniquely determines c, since is not a special point and 
therefore g{Uj F) 0. Hence Q has the factor / — eg. The 
quotient is either a constant or has another such factor. Proceed- 
ing similarly, we have 

Theorem 5. Every homogeneous polynomial in u and v which is 
invariant under the 24 tetrahedral homogeneous linear transformations 
is a product of factors t, — eg, where the c’a are constants 

7^ 0, while f and g denote any two of the absolute invariants f, 
of degree 12, 

Hence an invariant of degree 12 which vanishes at no one of the 
6 + 4 + 4 special points can be expressed in each of the forms 
aif^ — c^^), b{f — where a, fe, c, d are constants 7^ 0. 

Thus t^, and 4^® satisfy a linear identity. By considering the 
terms u^^ and u^'^ v, we find 

(18) 12 (-3)^2+ = 


Exercises 

1. If 2 and s' are the stereographic projections of two diametral points on 
the unit sphere, ~ 1 Hence derive (16) from (15) by replacing z 

by —1/z in ^(u/v)^0. 

2 The Hessian of is — 4S-3^ i The Jacobian of ^ and is 32*3^ it 
When # is taken as the fundamental quartic form, the invariant I is zero, while 
J = — 4/(3-3H). The syzygy (22) of §12 reduces to (18). 

125. The octahedral group. In Fig. 2, the solid composed of 
the two pyramids with the vertices C and C' and common square 
base ABA'B' is a regular octahedron. It is unaltered by the 
rotations of the group Bn which leaves the tetrahedron 1234 
unchanged, together with 12 rotations which interchange the 
latter and its diametral tetrahedron 1'2'3'4', viz., the 6 rotations 
through + 90° about the axes AA', BS', and CC% and the 6 
rotations through 180° about the 6 diameters each bisecting two 
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edges of the octahedron. Products of these rotations replace A 
by A', Bj B\ C, C'; since only 4 rotations about OA leave the 
octahedron unaltered, there are only 6 X 4 rotations leaving it 
unaltered. Hence all rotations leaving the octahedron unaltered 
form a group It is generated by R and S of §123 and the 
rotation E about the f-axis through 90® counterclockwise when 
viewed from C, By (S), E is represented by z' = iz, one of 
whose two homogeneous forms is 

(19) U - 2-Hl + i)u, V - 2~H1 - i)y^ 

Theoeem 6, All rotations leaving unaltered a regular octahedron 
form a group R^a of order 24. The octahedron group Has of linear 
homogeneous transformations is generated by (19) and the two genera- 
tors (12) and (13) of the tetrahedral subgroup 

126. The invariants of the octahedral group BU. Consider the 
octahedron whose vertices are the intersections of the three 
coordinate axes with the unit sphere. 

Since the function t given by (17) vanishes only for the six 
values of z which correspond to the vertices, t is an invariant of 
H 48 . We saw that t is unaltered by all the transformations of JJ 24 . 
Under transformation (19), ^ becomes — t Hence is an absolute 
invariant of H^. 

The Hessian of ^ is 25TF, where 

(20) W ~ + 142i^ A' ^ 

The Jacobian of t and W is the product of 

(21) X == - 33u« - 33u4 ^ 

by a constant. Since each transformation of Has is of determinant 
unity and replaces t hj ± t, the properties of the Hessian and 
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the Jacobian (§§3, 5) show that the transformation leaves W 
unaltered and replaces x by ± x* 

A point on the unit sphere is called a special point if it is un- 
altered by at least one rotation, other than the identity, of Ru 
and hence lies on the axis of that rotation. We saw that the special 
points on axes of rotations of the tetrahedral group Rn are the 
points for which toxW = vanishes. By §125, the only fur- 
ther special points are the central projections on the sphere of 
the mid points of the 12 edges of the octahedron. For these points 
X must vanish. In fact, since x is an invariant it vanishes for 
some set of 12 special points which is distinct from the set of the 
six vertices each counted twice, since x 5=^ ^ (cf. Ex. 3 below). 
The discussion at the end of §124 therefore leads to the 

Theorem 7. Every homogeneous polynomial in u and v which is 
invariant under the group of 48 octahedral linear homogeneous trans-^ 
formations is a product of factors t, TT, x? / cgr, where the c^s are 
constants, while f and g denote any two of the absolute invariants f^, 
W^, of degree 24. The latter satisfy the linear identity 

(22) 108^^ - + X^ - 0. 


Exercises 

1. Why is invariant under H 4 b^ 

2. Rsi is simply isomorphic to the S3rmmetric group on 4 letters representing 
the four diagonals 11', ... , 44' of the cube. 

3. The diameter which bisects the edge joining (0,0,1) and (1,0,0) 
meets the unit sphere at (i2*, 0, i2U> whose s is 1 4- 2i For this value of 
u and for t; == 1, X = 0. The invariant x therefore vanishes at the central 
projections on the sphere of the mid points of all 12 edges. 

4. There exists a fundamental system of invariants of any finite group G 

of linear transformations. By Hilbert's theorem ( §17) every invariant 1 of 
G is expressible in the form I ^ Ex Ii+ • • • + where the I are in- 

variants. Apply the transformations of G and add. Why does this proof hold 
also for relative invariants? 
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V 



Fig. 3. 


127. The icosahedral group. Consider a regular icosahedron I 
inscribed in a sphere having the center 0 and radius unity. It has 
20 triangular faces, 12 vertices, and 30 edges. If V is any vertex, 
I is unaltered by the rotation S about the axis OV through angle 
2^/5 counterclockwise viewed from U. Letter the triangular 
faces Aq VAi, At FA 2 , . . . , A 4 VAq having the common vertex 
V so that S carries Ao to Ai, Ai to A 2 , — , A 4 to Aq. The rotation 
through angle t about the axis joining 0 to the mid point of an 
edge AB wiU be denoted temporarily by [AB]; it leaves I un- 
altered. Write T for [Ao F]. Since S carries the mid point of 
Ao F to that of Ai F, 5“"^ TS leaves the latter point unaltered and 
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hence is [Ai V]. Similarly, TS^ - [A, V], We shaU speak 
of these five rotations [A j V] of order 2 and the five powers of S 
as the ten rotations connected with vertex F* They are trans- 
formed into the ten rotations connected with vertex by 
[A ifc F], which replaces F by A A glance at Fig. 3 shows that the 
vertex F', diametrically opposite to F, is the only vertex which 
cannot be reached by an edge starting from one of the vertices 
Ao, . . . , A 4 . If Ao R is either of the two edges having Ao as one 
end point and distinct from Ao A 4 , Ao F, Ao Ai, the rotation 
[Ao B] transforms the five rotations about the axis OAq into the 
five rotations about the axis OBy and one of the latter carries Ao 
to F'. We have now proved that suitably chosen products of S 
and T carry F to all 12 vertices. The five powers of S are the only 
rotations of I into itself which leave F unaltered. Hence there 
exist exactly 5 X 12 rotations of I into itself. 

Theorem 8. S and T generate the icosahedral group iJeo of all 
rotations of I into itself. 


Exercises 

1. Since T interchanges V and Ao, as weU as' Ai and A4, the product 
ST — (FAo A4) • • • is a rotation of order 3 whose axis joins 0 to the mid 
point of face FAo A4. 

2. The proof that S and T generate the ten rotations connected with any 
of the five vertices Ak shows similarly that they generate the ten rotations 
connected with any of the five vertices of type B and the ten connected with 
F', 

3. Hence S and T generate the 6X4 rotations of order 5 about the 6 
diameters joining pairs of opposite vertices, the 15 rotations of order 2 about 
diameters each of which bisects two edges, and (Ex. 1 ) the 2 X 10 rotations 
of order 3 about diameters each of which passes through the mid points of 
two faces. These with the identity rotation give the 60 rotations of R&q. 

128 . The linear homogeneous icosahedral group Choose 
rectangular coordinate axes so that the positive f-axis extends 
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from 0 to y and the axis of the rotation T lies in the second and 
fourth quadrants of the Jf-plane. By (8), rotation S is 2' = ez, 
one of whose two homogeneous forms is 


(23) 


U = e^u, F - €2 z;, e - 


The vertex Aq is represented by a real negative number n. 
Since S carries A^ to the vertices Ai, A^, As, A4 are repre- 
sented by en, n, i n, n. Since T interchanges Aq and F, 
which are represented by n and co, and interchanges the diamet- 
rically opposite points, represented by m == — and 0, we see 
that 

^ , nz + I 

z — n 

Since T interchanges Ai and A4, z — en implies z' = n. Hence 
^ g oj. q, ^4)2 gijit € + 6^ is positive 

and n is negative. Hence n(€+ = — 1. Applying the powers 

of S {z' ~ €^) to m and 71, we conclude that the 12 vertices of the 
icosahedron are represented by 


(24) 


2: = 0, 00, €* m, €* n 

(fc = 0, 1, 2, 3, 4; m = € + n = + e^). 


Multiply the numerator and denominator of T by ^ we 

get 

(€ — e^)z 4 - 


T : 


z = 


(€^~ e^)z- (€- 6 ^) 
whose determinant is 5. Hence a homogeneous form of T is 


( 25 ) 


5^ {7 = (f — + (e® — e^)Vj 

5iF= (e^- e^)u- (€~ e^)v. 
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As a check, the square of the latter is 17 == — F = ~ t’. 

Theorem 9. The icosahedral group Hm is generated by (23) and 
(25). 

129. The invanants of the icosahedral group H 120 . The form 
which vanishes at the 12 values (24) of 2 ; == u/v is 

/ = uv{u^ — m® v^) {u^ — v^). 


We desire for r = 5 the sum Sr of the rth powers of the roots m 
and n of + x — 1 = 0. Multiply by insert the roots, and 
sum; we get 

Sr + Sr-l — Sr-2 = 0. 

Using this recursion formula, we get 
§5 = — §3 == 2sz — S2 = — 3s 2 “f* 2si == 5si — 6 == — 11. 

Since mn == — 1, we have 

(26) f — uv (u^° + llu^ — t?^°). 

The Hessian of / is 121H, and the Jacobian of / and H is 207, 
where 

(27) H - 2;2o +- 228 - 494^^° 

(28) T=u^^+v^^+b22 {u^ _ ioo05 


A special point of the sphere is defined to be a point which takes 
fewer than 60 positions under the group Eeo* Hence it is a point 
on the axis of one of the rotations other than the identity. The 
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enumeration of these rotations in the preceding Ex. 3 shows that 
every special point belongs to one of the following sets: the 12 
vertices (at which / vanishes), the central projections on the 
sphere of the mid points of the 20 faces, and those of the mid 
points of the 30 edges. An aggregate of two or more sets, not 
necessarily distinct, never gives a set of 20 or 30 points. Hence 
the invariant H vanishes at the central projections on the sphere 
of the mid points of all 20 faces, while the invariant T vanishes 
at those of the mid points of aU 30 edges. 

Evidently / is unaltered by transformation (23). Computation 
shows that / is unaltered by (25), and hence is an absolute in- 
variant of Hm* The same is true of H and T since the trans- 
formations have determinant unity. Proceeding as at the end of 
§124, we obtain 

Theorem 10. Every homogeneous 'polynomial in u and v 'which is 
invariant under the icosahedral group Hm om, absolute invariant 
'which is a polynomial in the invariants /, LT, and T. The latter satisfy 
the identity 

(29) ^ 1728/5 HK 

130. The form problems for the tetrahedral and octahedral 
groups. By §124, every polynomial absolute invariant of the 
homogeneous tetrahedral group H 2 i is a polynomial in the absolute 
invariants t,W= and which satisfy the identity 

(30) 4>3[12(~ 3)^ f 

Given arbitrary values for t, W, compatible with relation (30), 
we seek the sets of values of Uy v. This is the form problem for 
^ 24 - From one set of solutions Uy v, we evidently obtain 24 sets 
by appl 5 dng to it the 24 substitutions of H 24 . That there are ex- 
actly 24 sets of solutions is seen as follows: Write 


(31) 


X{Uy V) = W/ty 
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so that Zi and X are the given values of these absolute invariants. 
The first quotient is a function of 0 = u/v. The corresponding 
equation 

(32) (^4 _ 2 • 3^ iz^ + 1)3 - Zi(^ + 2 • 3^ iz^ + 1)3 = 0 

has 12 solutions z. For each such 2:, we may compute X{z, 1 ) and 
then find two sets of values of v from 

(33) = X(w, v)/X(Zj 1), u = zv. 

The first relation (33) follows from the fact that the degree 8 of 
W exceeds the degree 6 of i by 2. The form problem for H24 there- 
fore reduces to the solution of equation (32) and extraction of 
square roots. 

Equation (32) is invariant under the group Fn of all tetrahedral 
linear fractional transformations on 2. Four of them (cf. Ex. 3, 
§123) are 2;' == ± 2:, 2' = ± 1/2, and these leave invariant the 
function 

(34) Z 2 = - 1)V(4^'). 

Employ the cube root w = |(— 1 + 3H) of unity. Then 
Z 2 ~ CO = (^^ - 2 * 3i iz^ + 1)7(422), 
while Z2 — co^ is obtained by changing i to — i. Hence 



Hence (32) may be solved by extracting a cube root (35) to get 
Z2, solving the quadratic equation (34) in 2^, and extracting the 
square root of the resulting value of 2^. 
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For the form problem of the octahedral group His, we are given 
arbitrary values of the absolute invariants W, tx, compatible 
with the relation (22) or 

- w^) + (t xy « 0 , 

and seek the 48 sets of values of u, v. Write 

(36) Z = Xiu, v) = W t/x. 

Since X is of degree 2 in m, u, we again have (33). For Zi = 
we see from (30) and W = that 

(37) - 3 • lUZi/iZi - ly = Z. 

The first equation (36) is of degree 24 in the unknown z. Its 
solution is equivalent to the solution of the quadratic equation 

(37) for Zi and the earlier equations for finding z from Zi. 

Thbobem 11. The form problems for ihe tetrahedral and octa- 
hedral groups are solvable by radicals. 

131. The form problem for the icosahedral group H 120 . Given 
arbitrary values of the absolute invariants/, E, and T, compatible 
with relation (29), we seek the 120 sets of values of u, v. Write 

(38) Z= f/'P, X (w, v)=f H/T. 

Since X is of degree 2 in u, v, we again have (33). Hence the form 
problem for Em reduces essentially to the solution oi f — Z P 
= 0, which after division by 1 )®“ becomes 

2 ®(z“ + Hz® - l)s — Z[z®» + 1 + 522(z“ - z®) 

- 10005 (z“ + z“)? = 0. 


( 39 ) 
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This equation of degree 60, having the single parameter Z, is 
called the icosahedral equation. When Z is arbitrary this equation 
will be shown to be not solvable by radicals, in contrast with the 
form problems for the tetrahedral and octahedral groups. Our 
first step wiU be to reduce the solution of (39) to that of a remark- 
able quintic equation. 

In (26) we gave the function / which vanishes for the twelve 
values of = u/v which represent the vertices of the icosahedron 
J. By inspection, / is unaltered by the transformation 

(40) U -= -V, V =^u; 2' - - lA. 

Hence the corresponding rotation F leaves I unaltered, so that 

(40) belongs to the group Hm- In §§127-8, we defined the rota- 
tion T through 180° about the axis OP in the ^f-plane, where P 
is the mid point of the edge Aq V. By Ex. 3, §123, F is the rota- 
tion through 180° about the ij-axis, and the product TF is the 
rotation through 180° about the axis which is a common per- 
pendicular to Otj and OP, 

To find the points unaltered by (40), take = z, whence 
1 = 0, To find the points unaltered by T, write (25) in the 
non-homogeneous form and take z' = z. Similarly for TF, In 
the resulting three equations, we replace z by u/v. Hence the 
pairs of points unaltered by F, T, TF are respectively those for 
which 

Aq = + v^, Bq — — 2{e^ + ^)uv — 

Co = 2(6 + e'^)uv — 

vanish. Hence the points for which 

(41) to = Ao Bo Co = u^ + V — — 2ut^ + 

vanishes correspond to the vertices of a regular octahedron. Thus 
to is the fundamental invariant of the octahedral group referred 
to axes different from those used in §126. By (23), we have 
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( 42 ) S^: 

This replaces Ao, Ro, Cq by respectively 
Ak == V^, Bk = — 2{€^ + e^)uv — v^, 

Ck — — 2(€ + e'^)uv — z;\ 

Write tk — AkBkCk- For T defined by (25), 

UV = ^l{u^ + UV - v^), Z = (€3 - e^) (e - €^) = 

We readily verify that T replaces Ai by mB 2 j where 

m = |(2€-“ €^+ 2€^), 

Cl by (€^ — €^)A 2 , and Ci — Si = — 2luv by 

— 2{y? -{■ UV — v^) = (e^ — €®)A2 — (e^ + €^)C2. 

Hence T replaces Bi by (e^ + €^)C 2 . Since m{e^ — e^){e^ + e^) 
= 1, r replaces ii = Ai Bi Ci by fe. This implies that T replaces 
ti by tz since A^ki B^k, C-k are derived from A a;, Bk, Ck by replac- 
ing € by €~S which merely changes the signs of aU coefficients of 
r, a change not affecting quadratic functions of u, v. 

Since T permutes the end points of the axes of F and TF and 
leaves those of T unaltered, T changes Ao and Co into their nega- 
tives and leaves Bq unaltered. Hence, since T is of order 2, 

(43) T = {to) (^l U) (}z U)j S = (to ti 4 tz ti). 

Theobem 12. The icosahedral group of 60 rotations or of 60 
linear fractional transformations is simply isomorphic to the alter- 
nating group on five letters. 
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This may be seen geometrically. The 15 diameters each of which bisects 
two edges of the icosahedron fall into 5 triples each of three mutually per- 
pendicular diameters (like the axes of F, T, TF), Each triple can be converted 
into aU the remaining triples by rotations of Rm. Hence exactly 60/5 or 12 
rotations of Feo leave a given triple unaltered. The latter form a subgroup of 
the octahedral group E 24 , whose rotations of order 4 are not in Feo. Hence 
this subgroup is a tetrahedral group Rn From (43 ) it is easy to pick out the 
12 substitutions on ^ 0 , , ^4 which leave to unaltered and hence correspond 
to the subgroup Rn of Feo. For example, STS^ — (0)(2)(143), T, and 
F = TS^ TS^ T = (0) (14) (23) generate the homogeneous form H24 of Eiz, 

By (43) every transformation of the icosahedral group ffi2o 
permutes U, , . , amongst themselves and hence leaves unaltered 
their elementary symmetric functions. Thus . . • ? 4 are the 
roots of a quintic equation 

4” Cl “h C2 C3 C4 1/ “h C5 = 0, 

where is a polynomial of degree 6& in v which is an invariant 
of Hm- The largest is 30 and hence is < 60. Thus by §129, 
c ft is a product of factors /, H, T, whose degrees are 12, 20, 30. 
Since no such product is of degree 6 or 18, we have Ci = C3 = 0. 
Also, C2 = a/, C4 = C5 = cT, where a, 6, c are constants. 
Let Sk be the sum of the fcth powers of the roots. Since ci = Cs = 0, 
two of Newton’s identities reduce to + 2c2 = 0, §4 + C2 S2 
+ 4 c 4 == 0. Thus 

S 2 + 2a/ = 0, §4 + (46 — 2a^)P = 0, 

cT “b ^0 k iz — 0. 

in (42) replaces U by tkj we have 

tk = w® + 2€^* — 5 €* + • . . , 


(44) 

Since 

(45) 
whence 


th = + 4z^^^ z; — 6 + . . . , 

+ 8 V + 4:Vp^ + . . . . 
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The coefficients of Vj vF in the respective functions 

(44) are 

20 + 2a = 0, 20 + 46 - 2a2 = 0, c + 1 == 0, 

whence a = — 10, 6 == 45, c = — 1. This proves 

Theoeem 13. The octahedral forms fo, . . . , ^4 cire the roots of 

(46) t? - lOft^ + 45/2 i - r = 0. 

This equation was first obtained by Brioschi ( §120). 
Introduce in place of the root t of (46) the new unknown 
y) = tpfTf which is of degree zero in u and v and hence is a func- 
tion of z. In (46), replace t hy w T/Pj multiply all terms by 
P^/T^, and write Z iovf/T^, in accord with (38). We get 

(47) - lOZw^ + 45^2^4? - ^2 == 0. 

132 . The principal qtdntic resolvent of the icosahedral equa- 
tion. We employ the functions tk defined by (41) and (45). The 
Hessian of i jb is 400 W h, where 

Wic = — V — 7 € 2 * 

-7^^ - €2^ v\ 

Since the generators T and S of the icosahedral group Hm are 
of determinant unity and permute ^o, • . . ? ^4 according to the 
substitutions (43), they permute their Hessians and hence 
TFo, . . . , W4 according to the same substitutions. Thus 
^tk^Wk^ is unaltered by T and S and is therefore an invariant 
of Hi 20 j and hence is a polynomial in /, H, T, whose degrees are 
12, 20, 30, respectively. No linear combination of the latter with 
integral coefficients ^ 0 reduces to 8, 14, 16, 22, or 28. Hence 
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Z ^ = 0. E = 0, 2^ IF* = 0, E = 0, E = 0- 

Thus E y = 0. S == 0, if 

(49) F* = <rFfc + r h TFj (fc = 0, 1, 2, 3, 4). 

Hence the latter are the roots of a principal quintic 

(50) P + 5aF2 + 66F + c = 0. 

By Newton’s identities, we have 2^ ~ 15o, 23 

= — 206. An inspection of the degrees of the invariants yields 
the following identities except as to the prefixed numerical factors 
which were found by a comparison of single terms of each member: 

TF3 _ 5 . 24/2, ^ = _ 57 ^ 

J^fW^=-5-72f, = - 15/7’, 

23 TF« = 20fH, 2 <2 IF* = - 60f H, 

2 f IF" = - 5HT, - 540/® H. 

Also 23 — 0. Hence 

a = 8f <7® + 7’(r® r + 72/® (TT® +/rr®, 

6 = - fHa* + 18/® Hv® t® + HTa-r^ + 27/® Ft*. 

To compute c, note that (46) gives 

3^ — lO/r® + 45/® a; — 7’ = H (r — tk), 

identically in x. For a; = — v/t, this gives 

II(v + Th) = <7® - 10/<r® T® + 45/® err* + Tt®. 
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But HW h — — the numerical factor being found by com- 
paring one term of each member. Hence 

c = - H ^2 + 45^2 ^^4 + 

To obtain a resolvent of the icosahedral equation (39), we pass 
to a quintic equation which involves /, i?, T explicitly only in the 
combinations 

z = V - 

where 

( 61 ) 2-1 + 7 = 1728 


by the identity (29). To this end, write 


(52) 

^ = U/H, 

r = y^P/m)^ 

Then 




\Va = 8X3+ XV 

+ (72Xm3 + M®)Z, 

(53) 

^ F6 = _ X4 + 18XV" 2 + Xm* Z + 27m" 


lYc = - 10\^ Z + 4.5XfjL^ Z\ 

When these values of a, 6, c are inserted into (50), we obtain the 
principal resolvent of the icosahedral equation. 

133. Identification of any principal quintic with the principal 
resolvent* Given a, 6, c, we shall prove that equations (51) and 
(53) can be satisfied by choice of X, Z, V. To the third equa- 
tion (53) add the product of the second by X; we get the product 
of the first by Z, whence 


(54) 


X6 + c = Za. 
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Multiply the third equation (53) by X and the second by Z 
and subtract; we get 

(55) V{\c - y? Zb) = (X2 - Z)^ 

From the first two equations (53), we get 

V (\a + 85)/m = X^ + 216X2 /mZ + 9Xm" Z + 216^' Z\ 

Divide the square of this by Z and subtract the result from 27 
times the square of the first equation (53), and simplify by use of 
(51); we get V (X^ — 3^^ Z)^. Employing (55), we get 


27a2 ~ 


(Xa + 85)2 

fi^Z 


Xc — jj} Zb. 


Elimination of iJ' Z by means of (54) gives 


(56) 


X2(a^ + abc — ¥) — X(lla^ 6 — ac2 + 262 c) 

+ 64a2 62 - 27 c - hc^ -- 0. 


After finding a root X of this quadratic, we deduce Z from (54), 
V from (55), and then Z from (51). By means of the resulting 
value of y?' and the first equation (53), we can find y rationally. 

The root X involves the discriminant A of (56). Let P denote 
the product of the squares of the differences of the roots of 
giY) = 0, defined by (50). It is known^ that P is the resultant 
of g{Y) and its derivative, whence P 5^ Dj where 

D = 108a5 c - 135a4 ¥ + 90a2 5^2 - 320a63 c + 2566^ + 

We find that A = aP D. In the theory of invariants, D (and not P) 
is called the discriminant of (50). 


^ Dickson’s First Course in the Theory of Equations, 1922, 152. 
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Hence equations (53) and (51) may be solved rationally for 
X, My -Z', V in terms of a, 6, c, DL 

Theokem 14. A general principal quintic equation may he identi- 
fied with the principal resolvent of the icosahedral equation. The only 
irrationality introduced in making this identification is the square 
root of the discriminant of the principal quintic. 

134. General quintic equation. By means of a transformation 
involving a square root, we may reduce the general quintic to a 
principal quintic ( §118). We just proved that the latter may be 
identified with the principal resolvent in §132 in which X, /Xy Z, V 
are constants; its roots are 


(57) 


\fW k pPtk Wk 
H HT 


(fc = 0, . . . , 4), 


which are of degree zero in u and v and hence are poljmomials in 
z = u/v. Hence the general quintic equation can be solved in 
terms of two square roots and a root z of the icosahedral equation 
(39). 

The latter can be solved in terms of eUiptic modular functions.^ 
If with Klein we regard as known a root 0 = <^(Z) of the icosa- 
hedral equation, we may compute the roots of the general quintic. 

135. Tschimhaus transformation of the special Brioschi re- 
solvent (47) into the principal resolvent of the icosahedral equa- 
tion. For the system of coordinates used in §§123-6, the values 
ol u IV for which the Hessian of the octahedral form t vanishes 
correspond to points on the sphere which are on axes of rotations 
of order 3. Since the same is therefore true for the system of 
coordinates in §§128, 131-2, the Hessian W of the new form t 

1 Klein, Math. Annalen, 14, 1879, 157-8. Bianchi, iUd., 17, 1880-1, 254-7. 
Klein, Ikosaeder, 1884, 130-6; Theorie der EUiptischen ModuKunctionen, I, 1890, 
125 ; 
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vanishes for values of z corresponding to mid points of faces of the 
icosahedron. Hence TF is a factor of H. Another factor is — 3/. 
For, if we transpose the term T of (46), square and replace 
by 3/, we get 3/(24f )2 = I728f == T% whence H = 0 by (29). 
A comparison of the coefficients of one term of each member gives 


( 68 ) 

From (57) with the 
and Z = we get 

(59) 


H s W{t^ - 3/), 

subscripts k suppressed, (58), t — w T/P^ 

Y = ^ 

Z-^ - 3* 


Theorem 15. The Tschirnhaus transformation (59) replaces 
resolvent (47) hy the principal resolvent of §132. 

Since (47) is a quintic well adapted for solution by elliptic 
functions, we may so solve the principal resolvent and hence any 
quintic. 

Exercise 

Give a direct proof that (59) transforms (47) into the principal resolvent. 
Apply Sylvester^s method to eliminate using (47) and its product by v), 
and the quadratic equation in w equivalent to (59) and its products by w, 
1 ^ 2 , w^j w*. Equating to zero the determinant of these seven linear equations 
in tr®, . . . , tt;, 1, show that we get (50), where a, 5, c are given by (53) and 
(51). 

136. The Galois group of the icosahedral equation. Let Fm de- 
note the group of the transformations 

z' ^Li{z) (i = l, ...,60) 

which are the linear fractional forms of the homogeneous trans- 
formations of the icosahedral group. In view of the generators 
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(23) and (25) of the latter, the Li(z) involve the single irra- 
tionality e. Let I (z) denote the left member of the icosahedral 
equation (39). Its coefficients belong to the field F — R{Z, e) 
and are invariant under the group Fgo. Hence if is any root of 
I (z) = 0, all its roots are given by Zi = Li(zi) for f = 1, . . . , 60. 
Thus any factor of I ( 2 ) whose coefficients are in F vanishes for all 
60 roots, whence I (z) is irreducible in F. 

Since Fm is a group, L^[L; ( 2 )] ^ Lh{z). Hence the replacement 
of Zi by Zi gives rise to a substitution Sj on the 60 roots 

We shall prove that the set of these 60 substitutions Sj has the 
characteristic properties A and B of §89 of the Galois group of 

I (z) = 0 for F, Let / {zi, , ^eo) be any rational function of the 

Zi with coefficients in F, When each Zt is replaced by Liizi), let 
/become (pizi), whose coefficients belong to F, 

First, let / be unaltered in value by each of the substitutions Sj, 
Then (^>( 2 : 1 ), . . . , <l>(zeQ) are all equal. Their sum, which is equal 
to 60/, is a symmetric function, with coefficients in F, of the roots 
of I ( 2 ) = 0. Hence / is equal to a quantity in F. 

Second, let / be equal to a quantity q in F, Then = g is 
an equation with coefficients in F which is satisfied by one root Zx 
of the equation I ( 2 ) = 0, irreducible in F, and hence ( §80) by 
aU its roots Zj, In other words, the <t>izj) are all equal, so that/ 
is unaltered in value by all the substitutions >S,-. 

Theorem 16. The Galois group G of the icosahedral equation 
(39) for the field iJ(Z, e) is simply isomorphic to the group Fqo of 
the linear fractional icosahedral transformations. 

Hence, by Theorem 12, G is simply isomorphic to the alternating 
group on five letters. The icosahedral equation is therefore not 
solvable by radicals ( §111). 

Theorem 17. The solution of either the icosahedral equation or its 
resolvent quintic equation (47) is equivalmt to the solution of the 
other. 
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For, if we have found a root z of (39), we may compute at once 
the rational functions wj, = Up/T of and hence have the roots 
of (47). Conversely, if we have found the roots Wh oi (47), we 
may compute ^ and hence all the roots of (39) as rational functions 
of the Wki Zy €. For by Theorem 12, the identity is the only trans- 
formation of F 60 which leaves unaltered each th and hence each 
Wk. whence the identity is the only substitution of G which leaves 
each Wk unaltered. Hence the adjunction of all the wi to the field 
F reduces G to the identity group, so that each root of (39) is in 
the enlarged field. 


Exercises 

1. The solution of the principal resolvent of the icosahedral equation is 
equivalent to the solution of the latter. Then by Theorem 17, the roots of the 
principal resolvent are rational functions of Z and the roots of resolvent (47), 
and conversely. This would have enabled us to predict the existence of a 
Tschirnhaus transformation which replaces (47) by the principal resolvent 
(§135). 

2. The icosahedral equation is its own Galois resolvent. 

3. For a field containing h and (- 3)1, the tetrahedral equation (32) is 
its own Galois resolvent, and its Galois group is simply isomorphic to F 12 
and hence to the alternating group on four letters. Thus (32) is solvable by 
radicals. 


137. Further results. The solution of the general quintic may 
be reduced to the form problem of a group of 60 linear transforma- 
tions on three variables^ The general sextic has been reduced to 
the form problem of Valentiner’s group of 360 ternary linear trans- 
formations which is simply isomorphic to the alternating group on 
six letters.^ The general equation of degree 7 reduces to the form 
problem of a linear group of order 1 71 on 4 variables.^ 

^ Klein, Ikosaeder, 1884, 211-60. 

^ Klein, Math. Annalen, 61, 1905, 50-76 (reprinted from Jour, fur Math., 129, 
1905, 151-74) ; Math- Abhandl., II, 1922, 481-502. 

® Klein, Math. Annalen, 28, 1887, 499-532; Math. Abhandl., II, 1922, 439-72. 
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The symmetric group on n letters can be represented as a linear 
homogeneous group on variables subject to the relation 

^Xi=^ 0(cL §140). But^ for > 7, neither the symmetric 
nor the alternating group on n letters is simply isomorphic to a 
linear group on n — 2 or fewer variables. In other words the 
general equation of degree n ^ 8 can be reduced to a form prob- 
lem of order n — 1, but not lower. For a clear summary of these 
and related topics, see Wiman’s article^ in Encyklopadie Math, 
Wiss,, I, 1, 1904, pp. 522-54. 

^Wiman, Math. Annalen, 52, 1899, 243-70. 

® Later literature. E. H. Moore, Math. Annalen, 51, 1898-9, 417-44. Coble, 
ibid., 70, 1911, 337-50; Trans. Amer. Math. Soc., 9, 1908, 396-424; 12, 1911, 311- 
25. Dickson, ihd., 9, 1908, 121-48; 12, 1911, 75-97. Speiser, Math. Annalen, 77, 
1916, 546-62. Schur, Berlin. Berichte, 1908, 664-78; Jour, fur Math., 127, 1904, 
20; 132, 1907, 85; 139, 1911, 155-64 (cf. de S6guier, Jour, de Math., (6), 6, 1910, 
387-436; 7, 1911, 113-21). Weber, Algebra, ed. 2, 1899, II, 228-301, 373-89, 
470-550. 
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REPRESENTATIONS OF A FINITE GROUP AS A 
LINEAR GROUP; GROUP CHARACTERS 

138. Introduction. As a sequel to the preceding chapter, we 
shall now prove some remarkable theorems on the representations 
of a given group as a linear group, and also give an introduction to 
the theory of group characters. The latter theory is an effective 
tool for finite groups and has led to theorems not proved other- 
wise, such as the fact that every group of order is solvable 
if p and q are primes, and that every transitive group of prime 
degree is either metacychc or doubly transitive. 

Starting with a simple example due to Dedekind, Frobenius 
developed the theory in a series of complicated memoirs.^ We 
shall follow the simpler exposition given by Schur^ partly because 
it is unusually attractive and partly since alternative introduc- 
tions^ to the subject are already available in English.^ 

139. Reducible linear groups. Let a finite number g of Tz-rowed 
non-singular matrices R form a group such that there exists a 
non-singular n-rowed matrix P for which 

1 Berlin. Berichte, 1896-1903. The author gave an elementary expositioii of 
Frobenius’s theory in Annals of Math., 4, 1902, 25-49, also an extension to groups 
of transformations modulo p, Trans. Amer. Math. Soc., 8, 1907, 389-98; Bull. 
Amer. Math. Soc., 13, 1906-7, 477-88. 

2 Berlin. Berichte, 1905, 406-32. Hypercomplex numbers were employed in the 
treatment by MoHen, Sitzungsber. Naturf. GeseU. Dorpat, 11, 1896, 259-88. 

^Blichfeldt, Finite CoUineation Groups, University of Chicago Tress, 1917, 
116-38. Miller, Blichfeldt, and Dickson, Finite Groups, 257-78. Burnside, 
Theory of Groups, ed. 2, 1911, 243-371, 464-84, 499. 

* The author added §146 and the examples. 
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for all g matrices R. Here and are square matrices with a 
and d rows respectively, Cr has d rows and a columns, while all 
elements of matrix 0 are zero. Then the group is called reducible. 

Theoeem 1. The matrices of a reducible group can be trans- 
formed simultaneously into^ 



With R and 5, the given group contains RS. Thus 

Mrs = P-l RSP = P-^ RP • P-^ SP = Mr Ms, 


whence 


(1) 

■^RS — -^RS — 

(2) 

Cbs == C^R + -Dr Cq. 


Multiply the members of the last equation on the right by 
As“i and sum for the g values of S. Since Ag Ag-i = Aj^ is the 
a-rowed identity matrix, we get 

2D ^RS ~ "p } P == -^ ^ Cq Ag-U 

s g s 

In the first sum replace S by S. Using (1), we get 

'£CsA,-.A^^gFA^. 

S 

Suppressing the common factor Qj we get 


^ Masclike, Math. Annaien, 52, 1899, 363. 
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(3) FA^==C^ + D^F. 

Consider the matrix of determinant unity 



By (3), Q = where is given in the theorem. Hence 
PQ is non-singular and transforms the given group into the group 
of matrices 


140. Representations of a group as a linear group, group matrix. 
Consider a group of substitutions on n letters. Let any two of its 
substitutions be 



m - ' y\nP Vfi f2 



To jB make correspond the linear transformation 


r : = “J?!, • . • , 0n = 'nn- 

To S corresponds s: ^71 == fi, . . . , i?n = fn. Hence to RS corre- 
sponds rs: = Tij - • • > fn = fn, if we define the product rs as 
in §22. 

Theorem 2. Any substitution group on n letters is simply iso- 
morphic to a linear group on n variables, 

A substitution group may be simply or multiply isomorphic to 
various linear groups. 

Let §1, ... y Si, be the substitutions of a group G (or elements 
of an abstract group G), whose identity is si. For i = 1, , . . , gr, 
let M Si be an /-rowed square matrix such that 
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(4) Ms.Msj === Msis, (hj = I, 

and sucli^ that not every Ms^ is singular. By (4) for j = 1, we 
have MsiiMsi — I) =0, whence M^i = I, the /-rowed identity 
matrix. Since Msi Af, = Ms^ = I, every M^i is non-singular. 

Then the set of linear transformations having the matrices 
Msi, . . - , Ms^ is called a representation of G as a linear group. 
When the Af^s are not all distinct, the linear group contains 
duplicate transformations and is multiply isomorphic to G. 

To enable us to treat all the Af’s Simultaneously, we introduce 
g independent variables Xsi(i = 1, . . . , and call 

(5) X = i M.. X., 

t-1 

the group matrix corresponding to the representation of G as the 
linear group {Af«i, . . . , 

For example, let Si = 1, §2 = (12) (34), S3 = (13) (24), 
Si = (14) (23). A group matrix is 

Xi X2 Xs X4 
X2 Xi X4 X3 
X3 X4 Xi X2 
X4 X3 Xo Xi 

Let T denote the transformation, of matrix X, which expresses 
, ?4 linearly in terms of 771, , 774. Introduce the new 
variables 

^3^ = ^2 + & + ^4, ^2 == + fe — ?4, 

fa = ?1 — & + — ?4j f4 = “ & — & + 

= *^1 + ^2 + ^?3 + ^4j 0^2 = ^7l 4“ ^2 “ 'jys ““ ‘»74> 

a>3 = 771 — 772 + 773 — 774, <^4 “ ^1 ‘*72 + i?4* 



1 This assumption, not made by Schur, simplifies some proofs. 
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Then T becomes 


fl = (Xl + 0^2 + iCs + 
f 2 = (Xi + X2 — Xz — X4.)o)2, 
fa == (Xl — a>2 + XS — X4:)0)3y 
?4 = (Xi - X2 - Xz + Xa)o}a- 

141. Irreducible group matrices. If X is a group matrix and 
P is a constant matrix whose determinant is not zero, then 
P~^ XP is a group matrix called equivalent to X. 

Lemma, If X and Y are two irreducible group matrices of orders 
f and h for which XP = PY, where P is a constant matrix with f 
rows and h columns, then either P — 0, or else f = h and the deter-- 
minant of P is not zero, so that X and Y are equivalent. 

Suppose that P 9 ^ 0. Then P is of rank r > 0. Write s = / — r, 
t = h -- r. By §30, there exist non-singular matrices A and B 
with / and h rows, respectively, such that 


Q-APB- 



where It is the r-rowed identity matrix, while Zrf is the zero matrix 
with r rows and t columns. Write 

AXA-^ = Xl = J'*), 

where Xrt has r rows and t columns, etc. We have 


Zi Q = AXA-^ Q = AXPB = APYB = APBYx = QYi. 
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Hence 


I;:) 


'Yrr 

^Zsr 



whence Xsr = 0, Fri = 0. If either s > 0 or ^ > 0, X or F would 
be reducible, contrary to hypothesis. Hence — Q,T=f=h, 
so that Q = If and P = A~^ is a square matrix whose 
determinant is not zero. 


Corollary. Every constant matrix M commutative with an 
irreducible group matrix X of order f is of the form clf. 

For, if c is a root of \M — z If \ — Oj P = M — clf is a con- 
stant matrix whose determinant is zero such that XP = PX, 
whence P = 0. 

If P is any one of the elements Si, ... ,Sg of a group (?, the 
corresponding /-rowed matrix (previously denoted by M^) of 
a representation of (? as a linear group will be designated by 
(cLijB.). In a second representation of (?, let R correspond to the 
A-rowed matrix = (&pgR). These notations are employed in 
the proof of the following 

Theorem 3. Consider any irreducible group matrix 


X = (x.,) 

(i, J = 1 , • 


of order f. The coefficients of 




(R — Sif . . 

• ) 

satisfy the relations 



^0 g 

(fi) X/ = 7 ^il ^ jh 

J 

(i) j) k, 1 = 

1, •••,/), 

where da = 1, d,j = 0(i ^ /). If 



Y = (y,,) 

( p,q = l ,.. 

.,h) 



§141] 


IRUEDUCIBLE GROUP MATRICES 


257 


is a second irreducible group matrix of order h belonging to the same 
group Gy which is not equivalent to Xy and if 


Vpa “ S % 


then 


(i? — Sly j Sg^y 


(7) Z = 0 (tiz]’"' ’{)■ 

it~s ^ Ij * • • ) h' 


For any two elements R and S of ff, we have 


Let U — (utj) be an /-rowed square matrix whose elements 
u ^3 are arbitrary constants, and write 


(8) V=^A^-.UA^. 

R=»«2 

Then 

As-iFAs 


The second member is equal to V since RS ranges over the ele- 
ments of the group G when R ranges over them. Since A^-i Ag 
— Aj = I/, we have VAq = Ag V. Hence V is commutative 
with every Ag and hence with X. By the Corollary, F = tJ Jy. 
By (8), t; is a hnear homogeneous function of the w’s: 


J, fc 


^ V O'ijR ^njkajolR — ^ilVy 
R j, k 


Hence by (8), 
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for every i, Z, and for all values oiu^k- Hence 

(Q) Ui j Q/ki u ^ d ii c jk (j'j jj ^ ^ } /)• 

Take I = i and sum for ^ = 1, we get 

EE CtfctR Ui/R-1 — fCjk* 

R z 


But Ar Ar -1 = Aj = 1/ implies 


ci'kin dijn-i = djfej. 


Summing this tor R — Si, Sg, we obtain the left member of 
the preceding equation. Hence fcfk = Inserting into (9) 

the resulting value of Cj*, we get (6). 

Let TV = (wtp) he s. matrix with / rows and h columns whose 
elements are arbitrary constants, and write 

2= ZAk-iTFBr. 

R 

As before, Ag-i ZB^ — Z, whence ZY = XZ, Since X and Y are 
not equivalent, the Lemma shows that Z = 0. Hence 

2-/ 2-) ip dpg-R = 0 

R y, p 

Since the Wjp are arbitrary, we have (7). 

Theoeem 4. If X ^ {xij), Y = {ypg)j ... are any irreducible 
group matrices of orders /, h, . . . belonging to the same group 
G = {si, . . . , sv^h that no two of the matrices are equivalent, 
then thep + h^ + • * ♦ homogeneous linear functions y^^, .. .of 
the variables are linearly independent. 
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For, if the c, . are constants such that 

^ ^ Ctj "4“ dpq y^q *4“ • • * 0, 

/ P, 5 

then in the notations of Theorem 3, 

"t" dpq hpqj^ 4“ * • * =0. 

3 P, Q 

Multiply by ajcm~^ and sum for i2 = Si, . . . , Sg, By (6) and (7), 
we get 

dii d 2k ” 0. 

This sum evidently reduces to cn- A similar proof gives d^q = 0. 

Theorem 5. The determinant of an irreducible group matrix is 
an irreducible function of the variables Xa^. Two irreducible group 
matrices are equivalent if and only if their determinants are identical. 

For, if X = is an irreducible group matrix of order /, the 
Xij are P linearly independent functions of the x^ (Theorem 4). 
Hence if the Uij denote P independent variables, the p equations 
Xii = u^ 3 ' determine P of the as homogeneous linear functions 
of the Uij and the remaining x^ ( §32). If the determinant | aji,* [ 
were a product of two polynomials in the x^y then after assigning 
the value zero to each of those g — P remaining x^, we obtain a 
decomposition of d = \uij'\ into two polynomials v and w in the 
Utj. Since d is of degree 1 in each variable Uj we may assume that 
V is of degree 0 and w of degree 1 in Un- No term of the expansion 
of the determinant d contains the product of by an element 
Uri of the first column. Hence v is of degree 0 in Uri, whence w is 
of degree 1 in it. Since UreUri is not a term oivw== d, y is of de- 
gree 0 in every Urc- Hence a determinant whose P elements are 
independent variables is irreducible. This completes the proof of 
the first part of Theorem 5. 
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If X = (Xii) and Y = are irreducible group matrices 

whicli are not equivalent, Theorem 4 shows that we can choose 
the Xr. so that the Xij and the y^q take any assigned values. Hence 
the determinants of X and Y are not identical. 

142. Reducible group matrices. By repeated applications of 
Theorem 1, we see that a reducible group matrix X is equivalent 

ZiO ••■00' 

0 X2 •••0 0 


0 0 -**0 

where each Yi is an irreducible group matrix. 

Theoeem 6. Every group matrix is equivalent to one of type Y 
in (10). 

Theoeem^ 7. If a group matrix X is equivalent to both Y and Z 
in (10), where also each Ziis irreducible, then m == n and Yi, , . . , 
Ym are equivalent in some order to Zi, , Zm- 

For, let dt and Di be the determinants of F i and Z^, respectively. 
By Theorem 5, di and Di are irreducible. Since 

dida dm = Di D2 • * • Dn, 

and since a polynomial is decomposable into irreducible polyno- 
mials in a single way, apart from the arrangement of the factors, 
we have m = n and see that di, dm are the products of 
Di , . . . , Dm in some order by constants. But for = 1 , 

= 0(jB 7 *^ /), the determinant of a group matrix reduces to unity. 
Hence the constant factors are all unity. By the last part of 
Theorem 5, matrices Fi, , . . , F^ are equivalent in some order to 
Zl, . . . , Zm- 

^ For a more general such theorem, see Loe^, Trans. Amer. Math. Soo., 4, 1903, 
44 . 


to a group matrix F in 



'Fi 0 • 

■ 0 

0 

0 

11 

0 72 ■ 

• •0 

0 


0 0 

•••0 

Y, 
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In view of Theorem 7, we shall call the group matrices Fi, . . , , 
Ym the irreducible components of the group matrix X, If two of 
them are equivalent they may be taken to be equal. Then if Yi 
occurs exactly times, we call Vi the index of Yi, and the set of 
r's the indices of F or X. 

Theorem 8. If X = ( Xi ,) is a group matrix with the indices 
ri, . . . , rz and if f ij ••• ,fi are the orders of its irreducible compo- 
nents, then exactly fi^ + fi^ of the functions Xi, of the variables 

are linearly independent Also r^ • • • + rz^ is equal to the 
number v of linearly independent constant matrices P which are 
commutative with X. Finally, I is equal to the number w of linearly 
independent constant matrices which are commutative with X and 
with every P. 

For, let Fi, . . . , Fm be the irreducible components of X, so 
that X is equivalent to F in (10). The first part of our theorem 
follows from Theorem 4. To prove the last two parts, we may as- 
sume that the F^ are chosen so that the first ri are equal, the 
following 7*2 are equal, etc. Then let Yi be of order ti. 

The numbers v and w remain unchanged if we replace X by F. 
Let P be a constant matrix commutative with F. We may write 


Pu 

Pn 

• • • Plm 

P ml 

P m2 

* * * P mm 


where the matrix Pi, has t^TowB and t, columns. Then FP = PY 
if and only if 

Y^ Pij — Pij Y j (i, j = 1, - . * , m). 

If Fi and F/ are not equal and hence not equivalent, then 
Pij = 0 by the Lemma. If they are equal, the corollary in §141 
shows that Pi, = c Fi, where Pi = 7 a is the identity matrix of 
order ti = t,. Hence 
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Cn Pi * * 

• CujFi 

0 

• • • • 0 

Cril Fi ■ ■ 

‘ * C r rx Pi 

0 ■ 

.. 0 

0 • 

0 

dll Pri+1 ’ 

dir^ Prx+1 

0 ... 

.... 0 

df2i P **1+1 



Conversely, its origin shows that every such matrix P is com- 
mutative with X. The number of its arbitrary parameters ca, 
djcnj . . . is ?; = ri^ + ^ 2 ^ + • * • + 

Finally, let Q be a constant matrix commutative with X and 
with every P. Since Q is commutative with X, it may be derived 
from P by replacing the c, d, . . . by C, P, . . . . Since Q shall be 
commutative with every P, matrix (Cij) must be commutative 
with every matrix (ci,), and {Dkn) with every (dkn), etc. Hence 
(Cij)j (Pfcn), . . . are products of identity matrices of orders 
n, ^ 2 , . . . by constants. These are also sufficient conditions that 
Q be commutative with X and with every P. The number w of 
arbitrary parameters in Q is evidently I 

143. Regular group matrix. With any group G = {si, . . . , 
is associated a p-rowed square matrix X whose element in the ith. 
row and jth. column is x with the subscript Hence X 

= 21 summed for aU the elements 5 of G, where Ms denotes 
the matrix whose element aj in the ith row and yth column is 1 
or 0 according as s^ is or is not equal to 5. Similarly, M t is the 
matrix whose element in the jth row and fcth column is 1 or 0 
according as is or is not equal to t In the product Ms ikf f, 
the element in the ^‘th row and Mh column is tt** = (TijTjk* 
Unless the product ar is zero, it is 1 and 

Sisr^ = s, SjSjT^ = tj 

whence = si. Hence if the latter equation does not hold, 

then TTi* = 0. If it holds, the single term oi it ih having a value 
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0 is that for which j is determined by Sj = tsk, and then 
TTik = 1. Hence Ms Mi = Mst^ 

If Si is the identity of G, evidently Ms^ is the ^-rowed identity 
matrix. This completes the proof of 

Theobem 9. X is a group matrix corresponding to G. 

It is called the regular group matrix. When G is a regular sub- 
stitution group ( §90), the method given at the beginning of §140 
to represent G as a linear group leads to our present matrices 

Theobem 10. Arrange the irreducible components Xi of the 
regular group matrix X so that no two of Xi, ... y Xc are equivalent, 
while every further component is equivalent to one of these c com- 
ponents, If Xi is of order fi and index Ci (so that Xi is one of exactly 
Ci equivalent components of X), thmf^ = eifor every i, and c is the 
number of classes of conjugate elements of the group G of order g. 

Counting the number of rows in two ways, we have 

^i/i + * * • + 6c/c = 

Since each element of X is one of the g variables x, the first part of 
Theorem 8 gives 

+ * * • + /c^ = 

Let F be a matrix independent of the x^s. Let [si, S;] denote the 
element in the ith row and jth column of Y. By considering the 
element in the ith row and fcth column of XY and that of YX, 
we see that X and Y are commutative if and only if 

a;fc—l (i, fc = 1, . . . , g) , 

3 t 

identically in the x’s. The two indicated x's have equal subscripts 
if St = Si Sj'"^ sjc- By their coefficients, 


[^3j Sjfcl [Stj (i) i !>•••; fif)- 
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Defirie Sn by Si = $n S/. Then 

[Sj, Sfc] = SnSk] {jyk,n 1, , , , , g). 

If Si is the identity of (?, write for [^i, Si]. Then 

~ [Sj Si] = [Si, Sj], T == 

Since this matrix Y involves g independent parameters y, the 
second part of Theorem 8 gives 

ei? + • • • + = g- 

Hence 

(ei - /i)2 + • ‘ + {ec ~ /c)^ = ^ - 2g + g = 0, 


so that Ct = ft for every L 

A matrix Z independent of the x’s which is commutative with 
X is of type 7. Since Y differs from X only in the order of mul- 
tiplication of subscripts, a matrix commutative with every Y 
is of type X. Hence Z is commutative with X and with every Y 
if and only if its element in the fth row and jth column is simulta- 
neously of the forms 

.3.-1. (z, j = 1, . . . , g). 

Define s by Si == s, s. Thus 0 * = 2 ;^ if and only ii t = SjS s,-^ and 
hence if s and t are conjugates in (?. By the final part of Theorem 
8, the number c of independent parameters in Z is the number of 
classes of conjugate elements of G, 

If possible, let there be an irreducible group matrix H of order h 
corresponding to G which is not equivalent to one of the c irre- 
ducible components Xi, — , X^ of the regular group matrix. 
The first part of Theorem 8 shows that in the group matrix {xi^j) 
having the irreducible components Xi, . , . , Xc, H, 
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of the functions a; ij- of Xsp . . . jO^s^arelinearly independent, whereas 
evidently not more than g functions of them are linearly independ- 
ent. This proves 

Theokem 11. Corresponding to a group G, the total number of 
irreducible group matrices no two of which are equivalent is equal to 
the number c of classes of conjugate elements of G, 

144. Group characters. Consider a representation of G — {si 
= 1, § 2 , . . . , Sff} as a group of linear transformations with the 
non-singular matrices Denote the trace (sum of the diagonal 
elements) of the latter by x(St). The set of g numbers xC^i), . . . , 
x{Sg) is called a character of G corresponding to this representation 
or to the group matrix Z = ^ belonging to it. 

For the 4-rowed group matrix X in the example in §140, M^i 
is the identity matrix, whence xM = 4; while Msg is obtained 
from X by taking X 2 = 1 , Xi = 0 {i 9 ^ 2), whence xM = 0. We 
saw that f 2 = 2 ; C 02 , where z = Xi + 0 C 2 -- x^ — If we employ 
this new group matrix (z) having a single element z, we obtain the 
new character x(si) = xM = 1, xM = x(s4) = - 1* 

Since equivalent matrices have the same trace, to equivalent 
representations or equivalent group matrices correspond the same 
character. By Theorem 11, there exist irreducible group matrices 
Xi, . . . , Xc, no two of which are equivalent, such that every 
irreducible group matrix is equivalent to one of them. The 
corresponding (simple) characters are denoted by xi(5t)>- • • 7Xc(st)* 

If among the irreducible components of the group matrix X, 
Xj occurs exactly r,* times (r,- ^ 0), evidently 

(11) x{s^) = n xl(s^) + \-r, Xc(s^). 

Let u and v be two distinct numbers of the set 1, ... c, and write 

= i: (a.-,a)a:R, ("’ I ‘ 

R-* R y - A, . . . , Jo,/ 
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Then 




In (6) replace i by m and k by /i, multiply the resulting equation 
by airr^ a^nT and sum for m, n = 1, . . . We get 


J J g 

O^imS (^knT O'nlR ~ ^ jn dimS O'kn'T- 

R m— 1 n==l J m, n 


The summand on the right is zero unless m = I, n = j. Since 
Aq Ar -1 = Asr- 1 , we may perform the summations on the left and 
obtain 

(12) ^ O'ijSR-^ ClklTR = 7 CLilS dkjT* 

R J 


From (7) we obtain in the same manner 
(13) ^USR"^ bpqTR ~ 0. 

R 


In (12) take j — i, I — k, T = 7; we get 


2^ ^tiSR-^ ClkkR = ~Ciik3 djfci. 
R J 


Sum for i and ft, and mark / with the subscript u. Thus 

(14) E X. iSR-^)x. (B) = 7 x» (-S). 

R Ju 

Since Xuil) = /vj the case S = I gives 

(15) Ex«(R-0x«(ii!) =( 7 . 
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Similarly, in (13) take j = q = p, T — I, and sum for i, p. 
We get 

(16) E (-Si2-I)x. (i?) =0 {U9^ v). 

R 

We return to the general group matrix X to which corresponds 
the character x{^%) for ^ = 1, . . . , g. Multiply (11) by Xu 
and sum for z = 1, . . . , g. We get 

0 CO 

E x(si)x«(s,~0 =E X!-(s0xu(si“^)- 

17=1 1=1 

For V 9 ^ u, the inner sum on the right is zero by (16) with S = I. 
For V — Uj it is g by (15). Hence Tu is determined by the values 
of x(®0 and the fixed numbers Xu(si^^). But Tj is the number 
of times X j- occurs as an irreducible component of X. This proves 

Theorem 12. Two representations of G as linear groups are 
equivalent if and only if their corresponding characters are the same, 

145 . Applications to group matrices. Let (? be a commutative 
group, and X = ^ Ma^ Xa^ be a corresponding irreducible group 
matrix of order/. Since M is commutative with X, it is of the form 
Ci 1/ by the corollary in §141. If is of order e^, ^ = If , whence 

Ci is an e^th. root of unity. If / > 1, X = ^ici Xafjlf would be a 
reducible matrix. 

Theorem 13. An irreducible group matrix corresponding to a 
commutative group is of order unity. 

See the example at the end of §140. 

It follows from Theorem 6 that every representation of a com- 
mutative group of order g is equivalent to one whose matrices 
have gth. roots of unity in the diagonal and zeros everywhere out- 
side the diagonal. This property therefore holds for the special 
case of a cyclic group and proves 
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Theoeem 14. The trace of any linear transformation of finite 
order is a sum of roots of unity. 

Theorem 15. The order of every irreducible group matrix of G 
is a divisor of the order g of G. 

Write jBs = 1 or 0 according as s == I or s 5 *^ 7. Then 

r x(5)i?sK-i = X(S), 

R 


SO that (14) may be written in the form 

z X(S){|S3E-1 - x(SE-o}= 0. 

Since X (7) = / ^ 0 , the determinant 


- x(SB-^) 


= 0 • 


(Sj R Slf . . . , Sgfi 


We saw that the trace % of SR~^ is a sum of roots of unity and 
hence is an integral algebraic number. Thus g/f is a root of an 
equation of degree g whose leading coefficient is unity and remain- 
ing coefficients are integral algebraic numbers. Hence it is an 
integral algebraic number. Being at the same time a rational 
number, it is a rational integer. 


146. The alternating group G on five letters. By Ex. 5, § 111 , G 
has exactly five sets of conjugate substitutions. Let the five repre- 
sentations of G as irreducible linear groups have the orders 1 , a, 
c, d. They are divisors of 60 and the sum of their squares is 60. 
Their maximum is evidently 6. If d = 6, -f- 6^ ^ ^2 -=23, 

which is impossible in integers. Let d == 5. Then -f* 6 ^ ^ ^2 
= 34. If c = 5 , + 52 = 9 Q,nd a or 6 is zero. If c = 4, a = 6 



§147] COMPUTATION OF GROUP CHARACTERS 269 

= 3. Not all of a, b, c are ^ 3. Finally, if a, 6, c, d are all S 4, not 
all are 4, while 1 + 4^ + 4^ + 4^ + 3^ = 58 < 60. 

Theoeem 16 . Every representation of the alternating group on 
five letters as an irreducible linear group is equivalent to one on 1, 3, 
4, or 5 variables^ there being two types on 3 variables. 

147. Computation of group characters. In (12) take k = j, 
I = and sum for i and j ; we get 

( 17 ) /Z xiSR-^ TR) = gx{S)xm- 

R 

Since the traces of Ag and Ag A^ are equal, 

(18) x{R8) = xim- 

Consider the symmetric group on 3 letters and write 

E = identity, A = (132), B = (123), C = (12), D = (13), 

F = (23). 

The three sets of conjugates are E; A, B; C, D, F. By (18), 

x(A) = x{B), xiC) = x{D) = x(F). 

By (17) for T = E, x(S)=/. By (17) for S = T = A- 
S = T = C ; S = A, T = C in turn, and by (15), we get 


3f + 3/ x(A) = &xHA), 2f + 4/ x(A) = Qx^(C), 
6/x(C) = 6x(A)x(C), P + 2xHA) + dx^C) = 6. 



270 


REPRESENTATION AS LINEAR GROUP [Ch. XIV 


If x{C) 5^ 0, the third relation gives x(-d) = /, and the other 
relations give x^iC) = P = 1. Since the order/ of the irreducible 
group matrix is positive, we have 

/=!, x(i?) = X(A) = 1, x(C') = ±l. 

If xCC) = 0, the conditions reduce to x(-4^) = ~ !/> /* = 4, 
whence 

/ = 2, xiE)=2, x(A) = -l, x(C) = 0. 
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